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Compounds and methods for inducing protective 
immunity against tuberculosis are disclosed. The com* 
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Description 

COMPOUNDS AND METHODS FOR IMMUNOTHERAPY 
AND DIAGNOSIS OF TUBERCULOSIS 

Technical Field 

The present invention relates generally to detecting, treating and 
preventing Mycobacterium tuberculosis infection. The invention is more particularly 
related to polypeptides comprising a Mycobacterium tuberculosis antigen, or a portion 
or other variant thereof, and the use of such polypeptides for diagnosing and vaccinating 
against Mycobacterium tuberculosis infection. 

Background of the Invention 

Tuberculosis is a chronic, infectious disease, that is generally caused by 
infection with Mycobacterium tuberculosis. It is a major disease in developing 
countries, as well as an increasing problem in developed areas of the world, with about 
8 million new cases and 3 million deaths each year. Although the infection may be 
asymptomatic for a considerable period of time, the disease is most commonly 
manifested as an acute inflammation of the lungs, resulting in fever and a nonproductive 
cough. If left untreated, serious complications and death typically result 

Although tuberculosis can generally be controlled using extended 
antibiotic therapy, such treatment is not sufficient to prevent the spread of the disease. 
Infected individuals may be asymptomatic, but contagious, for some time. In addition, 
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although compliance with the treatment regimen is critical, patient behavior is difficult 
to monitor. Some patients do not complete the course of treatment, which can lead to 
ineffective treatment and the development of drug resistance. 

Inhibiting the spread of tuberculosis requires effective vaccination and 
5 accurate, early diagnosis of the disease. Currently, vaccination with live bacteria is the 
most efficient method for inducing protective immunity. The most common 
Mycobacterium employed for this purpose is Bacillus Calmette-Guerin (BCG), an 
avirulent strain of Mycobacterium bovis. However, the safety and efficacy of BCG is a 
source of controversy and some countries, such as the United States, do not vaccinate 
10 the general public. Diagnosis is commonly achieved using a skin test, which involves 
intradermal exposure to tuberculin PPD (protein-purified derivative). Antigen-specific 
T cell responses result in measurable induration at the injection site by 48-72 hours after 
injection, which indicates exposure to Mycobacterial antigens. Sensitivity and 
specificity have, however, been a problem with this test, and individuals vaccinated 
1 5 with BCG cannot be distinguished from infected individuals. 

While macrophages have been shown to act as the principal effectors of 
M tuberculosis immunity, T cells are the predominant inducers of such immunity. The 
essential role of T cells in protection against M. tuberculosis infection is illustrated by 
the frequent occurrence of M tuberculosis in AIDS patients, due to the depletion of 
20 CD4 T cells associated with human immunodeficiency virus (HIV) infection. 
Mycobacterium-reactive CD4 T cells have been shown to be potent producers of 
gamma-interferon (IFN-y), which, in turn, has been shown to trigger the anti- 
mycobacterial effects of macrophages in mice. While the role of IFN-y in humans is 
less clear, studies have shown that 1,25-dihydroxy-vitamin D3, either alone or in 
25 combination with IFN-y or tumor necrosis factor-alpha, activates human macrophages 
to inhibit M. tuberculosis infection. Furthermore, it is known that IFN-y stimulates 
human macrophages to make 1,25-dihydroxy-vitamin D3. Similarly, IL-12 has been 
shown to play a role in stimulating resistance to M tuberculosis infection. For a review 
of the immunology of M. tuberculosis infection see Chan and Kaufmann in 
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Tuberculosis: Pathogenesis, Protection and Control, Bloom (ed), ASM Press, 
Washington, DC, 1994. 

Accordingly, there is a need in the art for improved vaccines and 
methods for preventing, treating and detecting tuberculosis. The present invention 
5 fulfills these needs and further provides other related advantages. 

Summary of the Invention 

Briefly stated, this invention provides compounds and methods for 
preventing and diagnosing tuberculosis. In one aspect, polypeptides are provided 
10 comprising an immunogenic portion of a soluble M. tuberculosis antigen, or a variant of 
such an antigen that differs only in conservative substitutions and/or modifications. In 
one embodiment of this aspect, the soluble antigen has one of the following N-terminal 
sequences: 

(a) Asp-Pro-Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Cys-Asn-Tyr-Gly- 
1 5 Gin-Val-Val-Ala-Ala-Leu; (SEQ ID No. 120) 

(b) Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Thr-Pro-Ala-Pro- 
Ser;(SEQIDNo. 121) 

(c) Ala-Ala-Met-Lys-Pro-Arg-Thr-GIy-Asp-Gly-Pro-Leu-Glu-Ala- 
Ala-Lys-Glu-Gly-Arg; (SEQ ID No. 122) 

20 (d) Tyr-Tyr-Trp-Cys-Pro-Gly-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly- 

Pro; (SEQ ID No. 123) 

(e) Asp-Ile-Gly-Ser-Glu-Ser-Thr-Glu-Asp-Gln-Gln-Xaa-Ala-Val; 
(SEQ ID No. 124) 

(f) Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro; (SEQ ID 
25 No. 125) 

(g) Asp-Pro-GIu-Pro-AIa-Pro-Pro-Val-Pro-Thr-Thr-Ala-Ala-Ser- 
Pro-Pro-Ser; (SEQ ID No. 126) 

(h) Ala-Pro-Lys-Thr-Tyr-Xaa^ 
Gly;(SEQ ID No. 127) 
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(i) Asp-Pro-Ala-Ser-Ala-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Leu- 

Thr-Ser-Leu-Leu-Asn-Ser-Leu-Ala-Asp-Pro-Asn-Val-Ser-Phe- 
AJa-Asn; (SEQ ID No. 128) 

(j) Xaa-Asp-Ser-Glu-Lys-Ser-Ala-Thr-Ile-Lys-Val-Thr-Asp-Ala- 
5 Ser; (SEQ ID No. 134) 

(k) Ala-Gly-Asp-Thr-Xaa-Ile-Tyr-Ile-Val-Gly-Asn-Leu-Thr-AIa- 
Asp; (SEQ ID No. 135) or 

(1) ^a-Pro-Glu-Ser-Gly-Ala-Gly-Leu-Gly-Gly-Thr-Val-Gln-Ala- 
Gly; (SEQ ID No. 136) 
1 0 wherein Xaa may be any amino acid. 

In a related aspect, polypeptides are provided comprising an 
immunogenic portion of an M. tuberculosis antigen, or a variant of such an antigen that 
differs only in conservative substitutions and/or modifications, the antigen having one 
of the following N-terminal sequences: 

15 (m > ^-Tyr-^e-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-Ile-Val-Pro-Gly-Lys- 

Ue-Asn-Val-His-Leu-Val; (SEQ ID No. 137) or 

(n) Asp-Pro-Pro-Asr>Pro-His-Gln-Xaa-Asp-Met-Thr-Lys-Gly-Tyr- 
Tyr-Pro-Gly-Gly-Arg-Arg-Xaa-Phe; (SEQ ID No. 129) 
wherein Xaa may be any amino acid. 

20 In another embodiment, the antigen comprises an amino acid sequence 

encoded by a DNA sequence selected from the group consisting of the sequences 
recited in SEQ ID Nos, 1, 2, 4-10, 13-25, 52, 99 and 101, the complements of said 
sequences, and DNA sequences that hybridize to a sequence recited in SEQ ID Nos.: 1, 
2, 4-10, 13-25, 52, 99 and 101 or a complement thereof under moderately stringent 

25 conditions. 

In a related aspect, the polypeptides comprise an immunogenic portion 
of a M. tuberculosis antigen, or a variant of such an antigen that differs only in 
conservative substitutions and/or modifications, wherein the antigen comprises an 
amino acid sequence encoded by a DNA sequence selected from the group consisting of 
30 the sequences recited in SEQ ID Nos.: 26-51, the complements of said sequences, and 
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DNA sequences that hybridize to a sequence recited in SEQ ID Nos.: 26-51 or a 
complement thereof under moderately stringent conditions. 

In related aspects, DNA sequences encoding the above polypeptides, 
expression vectors comprising these DNA sequences and host cells transformed or 
5 transfected with such expression vectors are also provided. 

In another aspect, the present invention provides fusion proteins 
comprising a first and a second inventive polypeptide or, alternatively, an inventive 
polypeptide and a known M. tuberculosis antigen. 

Within other aspects, the present invention provides pharmaceutical 
10 compositions that comprise one or more of the above polypeptides, or a DNA molecule 
encoding such polypeptides, and a physiologically acceptable carrier. The invention 
also provides vaccines comprising one or more of the polypeptides as described above 
and a non-specific immune response enhancer, together with vaccines comprising one 
or more DNA sequences encoding such polypeptides and a non-specific immune 
15 response enhancer. 

In yet another aspect, methods are provided for inducing protective 
immunity in a patient, comprising administering to a patient an effective amount of one 
or more of the above polypeptides. 

In further aspects of this invention, methods and diagnostic kits are 
20 provided for detecting tuberculosis in a patient. The methods comprise contacting 
dermal cells of a patient with one or more of the above polypeptides and detecting an 
immune response on the patient's skin. The diagnostic kits comprise one or more of the 
above polypeptides in combination with an apparatus sufficient to contact the 
polypeptide with the dermal cells of a patient. 
25 These and other aspects of the present invention will become apparent 

upon reference to the following detailed description and attached drawings. All 
references disclosed herein are hereby incorporated by reference in their entirety as if 
each was incorporated individually. 



o o 

WO 97/09428 PCT/US96/14674 



Brief De scription of the Drawings and Sequence Identifiers 

Figure 1A and B illustrate the stimulation of proliferation and interferon- 
y production in T cells derived from a first and a second M. tuberculosis-immune donor, 
respectively, by the 14 Kd, 20 Kd and 26 Kd antigens described in Example 1. 

Figure 2 illustrates the stimulation of proliferation and interferon-y 
production in T cells derived from an M. tuberculosis-immune individual by the two 
representative polypeptides TbRa3 and TbRa9. 



SEQ. ID NO. 1 is the DNA sequence of TbRal . 

SEQ. ID NO. 2 is the DNA sequence of TbRal 0. 

SEQ. ID NO. 3 is the DNA sequence of TbRal 1. 

SEQ. ID NO. 4 is the DNA sequence of TbRal 2. 

SEQ. ID NO. 5 is the DNA sequence of TbRal 3. 

SEQ. ID NO. 6 is the DNA sequence of TbRal 6. 

SEQ. ID NO. 7 is the DNA sequence of TbRal 7. 

SEQ. ID NO. 8 is the DNA sequence of TbRal 8. 

SEQ. ID NO. 9 is the DNA sequence of TbRal 9. 

SEQ. ID NO. 10 is the DNA sequence of TbRa24. 

SEQ. ID NO. 1 1 is the DNA sequence of TbRa26. 

SEQ. ID NO. 12 is the DNA sequence of TbRa28. 

SEQ. ID NO. 13 is the DNA sequence of TbRa29. 
SEQ. ID NO. 14 is the DNA sequence of TbRa2A. 
SEQ. ID NO. 15 is the DNA sequence of TbRa3. 
SEQ. ID NO. 16 is the DNA sequence of TbRa32. 
SEQ. ID NO. 17 is the DNA sequence of TbRa35. 
SEQ. ID NO. 18 is the DNA sequence of TbRa36. 
SEQ. ID NO. 19 is the DNA sequence of TbRa4. 
SEQ. ID NO. 20 is the DNA sequence of TbRa9. 
SEQ. ID NO. 21 is the DNA sequence of TbRaB. 
SEQ. ID NO. 22 is the DNA sequence of TbRaC. 



SEQ. ID NO. 23 is the DNA sequence of TbRaD. 
SEQ. ID NO. 24 is the DNA sequence of YYWCPG. 
SEQ. ID NO. 25 is the DNA sequence of AAMK. 
SEQ. ID NO. 26 is the DNA sequence of TbL-23. 
SEQ. ID NO. 27 is the DNA sequence of TbL-24. 
SEQ. ID NO. 28 is the DNA sequence of TbL-25. 
SEQ. ID NO. 29 is the DNA sequence of TbL-28. 
SEQ. ID NO. 30 is the DNA sequence of TbL-29. 
SEQ. ID NO. 31 is the DNA sequence of TbH-5. 
SEQ. ID NO. 32 is the DNA sequence of TbH-8. 
SEQ. ID NO. 33 is the DNA sequence of TbH-9. 
SEQ. ID NO. 34 is the DNA sequence of TbM-1. 
SEQ. ID NO. 35 is the DNA sequence of TbM-3. 
SEQ. ID NO. 36 is the DNA sequence of TbM-6. 
SEQ. ID NO. 37 is the DNA sequence of TbM-7. 
SEQ. ID NO. 38 is the DNA sequence of TbM-9. 
SEQ. ID NO. 39 is the DNA sequence of TbM-12. 
SEQ. ID NO. 40 is the DNA sequence of TbM-13. 
SEQ. ID NO. 41 is the DNA sequence of TbM-14. 
SEQ. ID NO. 42 is the DNA sequence of TbM-I5. 
SEQ. ID NO. 43 is the DNA sequence of TbH-4. 
SEQ. ID NO. 44 is the DNA sequence of TbH-4-FWD. 
SEQ. ID NO. 45 is the DNA sequence of TbH-12. 
SEQ. ID NO. 46 is the DNA sequence of Tb38-1. 
SEQ. ID NO. 47 is the DNA sequence of Tb38-4. 
SEQ. ID NO. 48 is the DNA sequence of TbL-17. 
SEQ. ID NO. 49 is the DNA sequence of TbL-20. 
SEQ. ID NO. 50 is the DNA sequence of TbL-21. 
SEQ. ID NO. 51 is the DNA sequence of TbH-16. 
SEQ. ED NO. 52 is the DNA sequence of DPEP. 
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SEQ. ID NO. 53 is the deduced amino acid sequence of DPEP. 

SEQ. ID NO. 54 is the protein sequence of DPV N-tenninal Antigen. 

SEQ. ID NO. 55 is the protein sequence of AVGS N-tenninal Antigen. 

SEQ. ID NO. 56 is the protein sequence of AAMK N-terminal Antigen. 
5 SEQ. ID NO. 57 is the protein sequence of YYWC N-terminal Antigen. 

SEQ. ID NO. 58 is the protein sequence of DIGS N-terminal Antigen. 

SEQ. ID NO. 59 is the protein sequence of AEES N-terminal Antigen. 

SEQ. ID NO. 60 is the protein sequence of DPEP N-teiminal Antigen. 

SEQ. ID NO. 61 is the protein sequence of APKT N-terminal Antigen. 
10 SEQ. ID NO. 62 is the protein sequence of DPAS N-terminal Antigen. 

SEQ. ID NO. 63 is the deduced amino acid sequence of TbRal. 

SEQ. ID NO. 64 is the deduced amino acid sequence of TbRal 0. 

SEQ. ID NO. 65 is the deduced amino acid sequence of TbRal 1. 

SEQ. ID NO. 66 is the deduced amino acid sequence of TbRal2. 
15 SEQ. ID NO. 67 is the deduced amino acid sequence of TbRal 3. 

SEQ. ID NO. 68 is the deduced amino acid sequence of TbRal 6. 
SEQ. ID NO. 69 is the deduced amino acid sequence of TbRal 7. 
SEQ. ID NO. 70 is the deduced amino acid sequence of TbRal 8. 
SEQ. ID NO. 71 is the deduced amino acid sequence of TbRal 9. 
20 SEQ. ID NO. 72 is the deduced amino acid sequence of TbRa24. 

SEQ. ID NO. 73 is the deduced amino acid sequence of TbRa26. 
SEQ. ID NO. 74 is the deduced amino acid sequence of TbRa28. 
SEQ. ID NO. 75 is the deduced amino acid sequence of TbRa29. 
SEQ. ID NO. 76 is the deduced amino acid sequence of TbRa2A. 
25 SEQ. ID NO. 77 is the deduced amino acid sequence of TbRa3. 

SEQ. ID NO. 78 is the deduced amino acid sequence of TbRa32. 
SEQ. ID NO. 79 is the deduced amino acid sequence of TbRa35. 
SEQ. ID NO. 80 is the deduced amino acid sequence of TbRa36. 
SEQ. ID NO. 81 is the deduced amino acid sequence of TbRa4. 
30 SEQ. ID NO. 82 is the deduced amino acid sequence of TbRa9. 



SEQ. ID NO. 83 is the deduced amino acid sequence of TbRaB. 

SEQ. ID NO. 84 is the deduced amino acid sequence of TbRaC. 

SEQ. ID NO. 85 is the deduced amino acid sequence of TbRaD. 

SEQ. ID NO. 86 is the deduced amino acid sequence of YYWCPG. 

SEQ. ID NO. 87 is the deduced amino acid sequence of TbAAMK. 

SEQ. ID NO. 88 is the deduced amino acid sequence of Tb38-1. 

SEQ. ID NO. 89 is the deduced amino acid sequence of TbH-4. 

SEQ. ID NO. 90 is the deduced amino acid sequerice of TbH-8. 

SEQ. ID NO. 91 is the deduced amino acid sequence of TbH-9. 

SEQ. ID NO. 92 is the deduced amino acid sequence of TbH-12. 

SEQ. ID NO. 93 is the amino acid sequence of Tb38-1 Peptide 1. 

SEQ. ID NO. 94 is the amino acid sequence of Tb38-1 Peptide 2. 

SEQ. ID NO. 95 is the amino acid sequence of Tb38-1 Peptide 3. 

SEQ. ID NO. 96 is the amino acid sequence of Tb38-1 Peptide 4. 

SEQ. ID NO. 97 is the amino acid sequence of Tb38-1 Peptide 5. 

SEQ. ID NO. 98 is the amino acid sequence of Tb38-1 Peptide 6. 

SEQ. ID NO. 99 is the DNA sequence of DPAS. 

SEQ. ID NO. 100 is the deduced amino acid sequence of DPAS. 

SEQ. ID NO. 101 is the DNA sequence of DPV. 

SEQ. ID NO. 102 is the deduced amino acid sequence of DPV. 

SEQ. ID NO. 103 is the DNA sequence of ESAT-6. 

SEQ. ID NO. 104 is the deduced amino acid sequence of ESAT-6. 

SEQ. ID NO. 105 is the DNA sequence of TbH-8-2. 

SEQ. ID NO. 106 is the DNA sequence of TbH-9FL. 

SEQ. ID NO. 107 is the deduced amino acid sequence of TbH-9FL. 

SEQ. ID NO. 108 is the DNA sequence of TbH-9- 1. 

SEQ. ID NO. 109 is the deduced amino acid sequence of TbH-9- 1. 

SEQ. ID NO. 1 10 is the DNA sequence of TbH-9-4. 

SEQ. ID NO. 1 1 1 is the deduced amino acid sequence of TbH-9-4. 

SEQ. ID NO. 1 12 is the DNA sequence of Tb38-1F2 IN. 



10 
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SEQ. ID NO. 1 13 is the DNA sequence of Tb38-2F2 RP. 
SEQ. ID NO. 1 14 is the deduced amino acid sequence of Tb37-FL. 
SEQ. ID NO. 1 15 is the deduced amino acid sequence of Tb38-IN. 
SEQ. ID NO. 1 16 is the DNA sequence of Tb38-1F3. 
SEQ. ID NO. 1 17 is the deduced amino acid sequence of Tb38-1F3. 
SEQ. ID NO. 1 1 8 is the DNA sequence of Tb38-1F5. 
SEQ. ID NO. 1 19 is the DNA sequence of Tb38-1F6. 
SEQ. ID NO. 120 is the deduced N-terminal amino acid sequence of DPV. 
SEQ. ID NO. 121 is the deduced N-terminal amino acid sequence of AVGS. 
SEQ. ID NO. 122 is the deduced N-terminal amino acid sequence of AAMK. 
SEQ. ID NO. 123 is the deduced N-terminal amino acid sequence of YYWC. 
SEQ. ID NO. 124 is the deduced N-terminal amino acid sequence of DIGS. 
SEQ. ID NO. 125 is the deduced N-terminal amino acid sequence of AEES. 
SEQ. ID NO. 126 is the deduced N-terminal amino acid sequence of DPEP. 
15 SE Q- ID NO. 127 is the deduced N-terminal amino acid sequence of APKT. 

SEQ. ID NO. 128 is the deduced amino acid sequence of DP AS. 
SEQ. ID NO. 129 is the protein sequence of DPPD N-terminal Antigen. 
SEQ ID NO. 130-133 are the protein sequences of four DPPD cyanogen 
bromide fragments. 

20 SEQ ID NO. 134 is the N-terminal protein sequence of XDS antigen. 

SEQ ID NO. 135 is the N-terminal protein sequence of AGD antigen. 
SEQ ID NO. 136 is the N-terminal protein sequence of APE antigen. 
SEQ ID NO. 137 is the N-terminal protein sequence of XYI antigen. 

25 Detailed D escription of the Invention 

As noted above, the present invention is generally directed to 
compositions and methods for preventing, treating and diagnosing tuberculosis. The 
compositions of the subject invention include polypeptides that comprise at least one 
immunogenic portion of a M. tuberculosis antigen, or a variant of such an antigen that 

30 differs only in conservative substitutions and/or modifications. Polypeptides within the 
scope of the present invention include, but are not limited to, immunogenic soluble 
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M. tuberculosis antigens. A "soluble M. tuberculosis antigen" is a protein of 
M tuberculosis origin that is present in M. tuberculosis culture filtrate. As used herein, 
the term "polypeptide" encompasses amino acid chains of any length, including full 
length proteins (/.<>., antigens), wherein the amino acid residues are linked by covalent 
peptide bonds. Thus, a polypeptide comprising an immunogenic portion of one of the 
above antigens may consist entirely of the immunogenic portion, or may contain 
additional sequences. The additional sequences may be derived from the native 
M tuberculosis antigen or may be heterologous, and such sequences may (but need not) 
be immunogenic. 

"Immunogenic," as used herein, refers to the ability to elicit an immune 
response (e.g., cellular) in a patient, such as a human, and/or in a biological sample. In 
particular, antigens that are immunogenic (and immunogenic portions or other variants 
of such antigens) are capable of stimulating cell proliferation, interleukin-12 production 
and/or interferon-y production in biological samples comprising one or more cells 
15 selected from the group of T cells, NK cells, B cells and macrophages, where the cells 
are derived from an M. tuberculosis-immune individual. Polypeptides comprising at 
least an immunogenic portion of one or more M. tuberculosis antigens may generally be 
used to detect tuberculosis or to induce protective immunity against tuberculosis in a 
patient. 

20 Th e compositions and methods of this invention also encompass variants 

of the above polypeptides. A "variant," as used herein, is a polypeptide that differs 
from the native antigen only in conservative substitutions and/or modifications, such 
that the ability of the polypeptide to induce an immune response is retained. Such 
variants may generally be identified by modifying one of the above polypeptide 

25 sequences, and evaluating the immunogenic properties of the modified polypeptide 
using, for example, the representative procedures described herein. 

A "conservative substitution" is one in which an amino acid is 
substituted for another amino acid that has similar properties, such that one skilled in 
the art of peptide chemistry would expect the secondary structure and hydropathic 

30 nature of the polypeptide to be substantially unchanged. In general, the following 
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groups of amino acids represent conservative changes: (1) ala, pro, gly, glu, asp, gin, 
asn, ser, thr, (2) cys, ser, tyr, thr, (3) val, ile, leu, met, ala, phe; (4) Iys, arg, his; and 
(5) phe, tyr, tip, his. 

Variants may also (or alternatively) be modified by, for example, the 
5 deletion or addition of amino acids that have minimal influence on the immunogenic 
properties, secondary structure and hydropathic nature of the polypeptide. For example, 
a polypeptide may be conjugated to a signal (or leader) sequence at the N-terminal end 
of the protein which co-translationally or post-translationally directs transfer of the 
protein. The polypeptide may also be conjugated to a linker or other sequence for ease 
10 of synthesis, purification or identification of the polypeptide {e.g., poly-His), or to 
enhance binding of the polypeptide to a solid support. For example, a polypeptide may 
be conjugated to an immunoglobulin Fc region. 

In a related aspect, combination polypeptides are disclosed. A 
"combination polypeptide" is a polypeptide comprising at least one of the above 
15 immunogenic portions and one or more additional immunogenic M. tuberculosis 
sequences, which are joined via a peptide linkage into a single amino acid chain. The 
sequences may be joined directly (i.e., with no intervening amino acids) or may be 
joined by way of a linker sequence (e.g., Gly-Cys-Gly) that does not significantly 
diminish the immunogenic properties of the component polypeptides. 
20 In general, M. tuberculosis antigens, and DNA sequences encoding such 

antigens, may be prepared using any of a variety of procedures. For example, soluble 
antigens may be isolated from M. tuberculosis culture filtrate by procedures known to 
those of ordinary skill in the art, including anion-exchange and reverse phase 
chromatography. Purified antigens are then evaluated for their ability to elicit an 
25 appropriate immune response (e.g., cellular) using, for example, the representative 
methods described herein. Immunogenic antigens may then be partially sequenced 
using techniques such as traditional Edman chemistry. See Edman and Berg, Eur. J. 
Biochem. 50:116-132, 1967. 

Immunogenic antigens may also be produced recombinantly using a 
30 DNA sequence that encodes the antigen, which has been inserted into an expression 
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vector and expressed in an appropriate host. DNA molecules encoding soluble antigens 
may be isolated by screening an appropriate M. tuberculosis expression library with 
anti-sera (?.g„ rabbit) raised specifically against soluble M. tuberculosis antigens. DNA 
sequences encoding antigens that may or may not be soluble may be identified by 
5 screening an appropriate M. tuberculosis genomic or cDNA expression library with sera 
obtained from patients infected with M. tuberculosis. Such screens may generally be 
performed using techniques well known to those of ordinary skill in the art, such as 
those described in Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratories, Cold Spring Harbor, NY, 1989. 
10 DNA sequences encoding soluble antigens may also be obtained by 

screening an appropriate M. tuberculosis cDNA or genomic DNA library for DNA 
sequences that hybridize to degenerate oligonucleotides derived from partial amino acid 
sequences of isolated soluble antigens. Degenerate oligonucleotide sequences for use in 
such a screen may be designed and synthesized, and the screen may be performed, as 
1 5 described (for example) in Sambrook et al., Molecular Cloning: A Laboratory Manual, 
Cold Spring Harbor Laboratories, Cold Spring Harbor, NY, 1989 (and references cited 
therein). Polymerase chain reaction (PCR) may also be employed, using the above 
oligonucleotides in methods well known in the art, to isolate a nucleic acid probe from a 
cDNA or genomic library. The library screen may then be performed using the isolated 
20 probe. 

Alternatively, genomic or cDNA libraries derived from M. tuberculosis 
may be screened directly using peripheral blood mononuclear cells (PBMCs) or T cell 
lines or clones derived from one or more M. tuberculosis-immune individuals. In 
general, PBMCs and/or T cells for use in such screens may be prepared as described 
25 below. Direct library screens may generally be performed by assaying pools of 
expressed recombinant proteins for the ability to induce proliferation and/or interferon-y 
production in T cells derived from an M. tuberculosis-immune individual. 
Alternatively, potential T cell antigens may be first selected based on antibody 
reactivity, as described above. 
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Regardless of the method of preparation, the antigens (and immunogenic 
portions thereof) described herein (which may or may not be soluble) have the ability to 
induce an immunogenic response. More specifically, the antigens have the ability to 
induce proliferation and/or cytokine production (/.£?., interferon-? and/or interleukin-12 
5 production) in T cells, NK cells, B cells and/or macrophages derived from an 
M tuberculosis-immune individual. The selection of cell type for use in evaluating an 
immunogenic response to a antigen will, of course, depend on the desired response. For 
example, interleukin-12 production is most readily evaluated using preparations 
containing B cells and/or macrophages. An M. tuberculosis-immune individual is one 
10 who is considered to be resistant to the development of tuberculosis by virtue of having 
mounted an effective T cell response to M. tuberculosis (/.«., substantially free of 
disease symptoms). Such individuals may be identified based on a strongly positive 
(/.e., greater than about 10 mm diameter induration) intradermal skin test response to 
tuberculosis proteins (PPD) and an absence of any signs or symptoms of tuberculosis 
15 disease. T cells, NK cells, B cells and macrophages derived from M tuberculosis- 
immune individuals may be prepared using methods known to those of ordinary skill in 
the art. For example, a preparation of PBMCs (z.e., peripheral blood mononuclear cells) 
may be employed without further separation of component cells. PBMCs may 
generally be prepared, for example, using density centrifiigation through Ficoll™ 
20 (Winthrop Laboratories, NY). T cells for use in the assays described herein may also be 
purified directly from PBMCs. Alternatively, an enriched T cell line reactive against 
mycobacterial proteins, or T cell clones reactive to individual mycobacterial proteins, 
may be employed. Such T cell clones may be generated by, for example, culturing 
PBMCs from M tuberculosis-immune individuals with mycobacterial proteins for a 
25 period of 2-4 weeks. This allows expansion of only the mycobacterial protein-specific 
T cells, resulting in a line composed solely of such cells. These cells may then be 
cloned and tested with individual proteins, using methods known to those of ordinary 
skill in the art, to more accurately define individual T cell specificity. In general, 
antigens that test positive in assays for proliferation and/or cytokine production (Le., 
30 interferon-y and/or interleukin-12 production) performed using T ceils, NK cells, B cells 
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and/or macrophages derived from an M tuberculosis-iinmune individual are considered 
immunogenic. Such assays may be performed, for example, using the representative 
procedures described below. Immunogenic portions of such antigens may be identified 
using similar assays, and may be present within the polypeptides described herein. 

The ability of a polypeptide (e.g., an immunogenic antigen, or a portion 
or other variant thereof) to induce cell proliferation is evaluated by contacting the cells 
(e.g., T cells and/or NK cells) with the polypeptide and measuring the proliferation of 
the cells. In general, the amount of polypeptide that is sufficient for evaluation of about 
10 5 cells ranges from about lOng/mL to about lOOfig/mL and preferably is about 
10 jig/mL. The incubation of polypeptide with cells is typically performed at 37°C for 
about six days. Following incubation with polypeptide, the cells are assayed for a 
proliferative response, which may be evaluated by methods known to those of ordinary 
skill in the art, such as exposing cells to a pulse of radiolabeled thymidine and 
measuring the incorporation of label into cellular DNA. In general, a polypeptide that 
results in at least a three fold increase in proliferation above background (/.e., the 
proliferation observed for cells cultured without polypeptide) is considered to be able to 
induce proliferation. 

The ability of a polypeptide to stimulate the production of interferon-y 
and/or interleukin-12 in cells may be evaluated by contacting the cells with the 
polypeptide and measuring the level of interferon-y or interleukin-12 produced by the 
cells. In general, the amount of polypeptide that is sufficient for the evaluation of about 
10 s cells ranges from about lOng/mL to about lOO^ig/mL and preferably is about 
10 yg/mL. The polypeptide may, but need not, be immobilized on a solid support such 
as a bead or a biodegradable microsphere, such as those described in U.S. Patent 
Nos. 4,897,268 and 5,075,109. The incubation of polypeptide with the cells is typically 
performed at 37°C for about six days. Following incubation with polypeptide, the cells 
are assayed for interferon-y and/or interleukin-12 (or one or more subunits thereof), 
which may be evaluated by methods known to those of ordinary skill in the art, such as 
an enzyme-linked immunosorbent assay (ELISA) or, in the case of IL-12 P70 subunit, a 
bioassay such as an assay measuring proliferation of T cells. In general, a polypeptide 
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that results in the production of at least 50 pg of interferon-y per mL of cultured 
supernatant (containing 10 4 -10 5 T ceils per mL) is considered able to stimulate the 
production of interferon-y. A polypeptide that stimulates the production of at least 
10 pg/mL of IL-12 P70 subunit, and/or at least 100 pg/mL of IL-12 P40 subunit, per 10 s 
5 macrophages or B cells (or per 3 x 10 5 PBMC) is considered able to stimulate the 
production of IL-12. 

In general, immunogenic antigens are those antigens that stimulate 
proliferation and/or cytokine production {i.e., interferon-y and/or interleukin-12 
production) in T cells, NK cells, B cells and/or macrophages derived from at least about 
10 25% of M. tuberculosis-immune individuals. Among these immunogenic antigens, 
polypeptides having superior therapeutic properties may be distinguished based on the 
magnitude of the responses in the above assays and based on the percentage of 
individuals for which a response is observed. In addition, antigens having superior 
therapeutic properties will not stimulate proliferation and/or cytokine production in 
15 vitro in cells derived from more than about 25% of individuals that are not 
M. tuberculosis-immune, thereby eliminating responses that are not specifically due to 
M. tuberculosis-responsive cells. Those antigens that induce a response in a high 
percentage of T cell, NK cell, B cell and/or macrophage preparations from 
M. tuberculosis-immune individuals (with a low incidence of responses in cell 
20 preparations from other individuals) have superior therapeutic properties. 

Antigens with superior therapeutic properties may also be identified 
based on their ability to diminish the severity of M. tuberculosis infection in 
experimental animals, when administered as a vaccine. Suitable vaccine preparations 
for use on experimental animals are described in detail below. Efficacy may be 
25 determined based on the ability of the antigen to provide at least about a 50% reduction 
in bacterial numbers and/or at least about a 40% decrease in mortality following 
experimental infection. Suitable experimental animals include mice, guinea pigs and 
primates. 

Antigens having superior diagnostic properties may generally be 
30 identified based on the ability to elicit a response in an intradermal skin test performed 
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on an individual with active tuberculosis, but not in a test performed on an individual 
who is not infected with M. tuberculosis. Skin tests may generally be performed as 
described below, with a response of at least 5 mm induration considered positive. 

Immunogenic portions of the antigens described herein may be prepared 
5 and identified using well known techniques, such as those summarized in Paul, 
Fundamental Immunology, 3d ed., Raven Press, 1993, pp. 243-247 and references cited 
therein. Such techniques include screening polypeptide portions of the native antigen 
for immunogenic properties. The representative proliferation and cytokine production 
assays described herein may generally be employed in these screens. An immunogenic 
10 portion of a polypeptide is a portion that, within such representative assays, generates 
an immune response {e.g., proliferation, interferon-y production and/or interleukin-12 
production) that is substantially similar to that generated by the full length antigen. In 
other words, an immunogenic portion of an antigen may generate at least about 20%, 
and preferably about 100%, of the proliferation induced by the full length antigen in the 
15 model proliferation assay described herein. An immunogenic portion may also, or 
alternatively, stimulate the production of at least about 20%, and preferably about 
100%, of the interferon-y and/or interleukin-12 induced by the full length antigen in the 
model assay described herein. 

Portions and other variants of M. tuberculosis antigens may be generated 
20 by synthetic or recombinant means. Synthetic polypeptides having fewer than about 
100 amino acids, and generally fewer than about 50 amino acids, may be generated 
using techniques well known to those of ordinary skill in the art For example, such 
polypeptides may be synthesized using any of the commercially available solid-phase 
techniques, such as the Merrifield solid-phase synthesis method, where amino acids are 
25 sequentially added to a growing amino acid chain. See Merrifield, J. Am. Chem. Soc. 
55:2149-2146, 1963. Equipment for automated synthesis of polypeptides is 
commercially available from suppliers such as Applied BioSystems, Inc., Foster City, 
CA, and may be operated according to the manufacturer's instructions. Variants of a 
native antigen may generally be prepared using standard mutagenesis techniques, such 
30 as oligonucleotide-directed site-specific mutagenesis. Sections of the DNA sequence 
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may also be removed using standard techniques to permit preparation of truncated 
polypeptides. 

Recombinant polypeptides containing portions and/or variants of a 
native antigen may be readily prepared from a DNA sequence encoding the polypeptide 
5 using a variety of techniques well known to those of ordinary skill in the art. For 
example, supernatants from suitable host/vector systems which secrete recombinant 
protein into culture media may be first concentrated using a commercially available 
filter. Following concentration, the concentrate may be applied to a suitable 
purification matrix such as an affinity matrix or an ion exchange resin. Finally, one or 
10 more reverse phase HPLC steps can be employed to further purify a recombinant 
protein. 

Any of a variety of expression vectors known to those of ordinary skill in 
the art may be employed to express recombinant polypeptides of this invention. 
Expression may be achieved in any appropriate host cell that has been transformed or 

15 transfected with an expression vector containing a DNA molecule that encodes a 
recombinant polypeptide. Suitable host cells include prokaryotes, yeast and higher 
eukaryotic cells. Preferably, the host cells employed are E. coli, yeast or a mammalian 
cell line such as COS or CHO. The DNA sequences expressed in this manner may 
encode naturally occurring antigens, portions of naturally occurring antigens, or other 

20 variants thereof. 

In general, regardless of the method of preparation, the polypeptides 
disclosed herein are prepared in substantially pure form. Preferably, the polypeptides 
are at least about 80% pure, more preferably at least about 90% pure and most 
preferably at least about 99% pure. In certain preferred embodiments, described in 

25 detail below, the substantially pure polypeptides are incorporated into pharmaceutical 
compositions or vaccines for use in one or more of the methods disclosed herein. 

In certain specific embodiments, the subject invention discloses 
polypeptides comprising at least an immunogenic portion of a soluble M tuberculosis 
antigen having one of the following N-terminal sequences, or a variant thereof that 

30 differs only in conservative substitutions and/or modifications: 
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(a) Asp-Pro- Val- Asp-Ala- Val-Ile-Asn-Thr-Thr-Cys-Asn-Tyr-Gly- 
Gln-Val-Val-Ala-Ala-Leu; (SEQ ID No. 120) 

(b) Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Tlir-Pro-Ala-Pro- 
Ser; (SEQ ID No. 121) 

5 (c) Ala-Ala-Met-Lys-Pro-Arg-Thr-Gly-Asp-GIy-Pro-Leu-Glu-AIa- 

Ala-Lys-Glu-Gly-Arg; (SEQ ID No. 122) 

(d) Tyr-Tyr-Trp-Cys-Pro-Gly-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly- 
Pro; (SEQ ID No. 123) 

(e) Asp-Ile-GIy-Ser-Glu-Ser-Thr-Glu-Asp-Gln-GIn-Xaa-Ala-Val; 
10 (SEQ ID No. 124) 

(f) Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro; (SEQ ID 
No. 125) 

(g) Asp-Pro-Glu-Pro-Ala-Pro-Pro-Val-Pro-Thr-Ala-Ala-Ala-Ser- 
Pro-Pro-Ser, (SEQ ID No. 126) 

15 00 Ala-Pro-Lys-Thr-Tyr-Xaa-Glu-Glu-Leu-Lys-Gly-Thr-Asp-Tbx- 

Gly; (SEQ ID No. 127) 

(i) Asp-Pro-Ala-Ser-Ala-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Leu- 

Thr-Ser-Leu-Leu-Asn-Ser-Leu-Ala-Asp-Pro-Asn-Val-Ser-Phe- 
Ala-Asn; (SEQ ID No. 128) 

20 0) Xaa-Asp-Ser-Glu-Lys-Ser-Ala-Thr-Ile-Lys-Val-Thr-Asp-Ala- 

Ser,(SEQIDNo. 134) 

(k) Ala-Gly-Asp-Thr-Xaa-Ile-Tyr-Ile-Val-Gly-Asn-Leu-Thr-Ala- 
Asp; (SEQ ID No. 135) or 

(1) Ala-Pro-Glu-Ser-Gly-Ala-Gly-Leu-GIy-Gly-Thr-Val-Gln-Ala- 

25 Gly; (SEQ ID No. 136) 

wherein Xaa may be any amino acid, preferably a cysteine residue. A DNA sequence 
encoding the antigen identified as (g) above is provided in SEQ ID No. 52, and the 
polypeptide encoded by SEQ ID No. 52 is provided in SEQ ID No. 53. A DNA 
sequence encoding the antigen defined as (a) above is provided in SEQ ID No. 101; its 

30 deduced amino acid sequence is provided in SEQ ID No. 102. A DNA sequence 
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corresponding to antigen (d) above is provided in SEQ ID No. 24 a DNA sequence 
corresponding to antigen (c) is provided in SEQ ID No. 25 and a DNA sequence 
corresponding to antigen (i) is provided in SEQ ID No. 99; its deduced amino acid 
sequence is provided in SEQ ID No. 100. 

In a further specific embodiment, the subject invention discloses 
polypeptides comprising at least an immunogenic portion of an M tuberculosis antigen 
having one of the following N-tenninal sequences, or a variant thereof that differs only 
in conservative substitutions and/or modifications: 

(m) Xaa-Tyr-Ile-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-IIe-Val-Pro-Gly-Lys- 
Ile-Asn-Val-His-Leu-Val; (SEQ ID No 137) or 

(n) Asp-Pro-Pro-Asp-Pro-His-Gln-Xaa-Asp-Met-Thr-Lys-Gly-Tyr- 
Tyr-Pro-Gly-Gly-Arg-Arg-Xaa-Phe; (SEQ ID No. 129) 
wherein Xaa may be any amino acid, preferably a cysteine residue. 

In other specific embodiments, the subject invention discloses 
polypeptides comprising at least an immunogenic portion of a soluble M. tuberculosis 
antigen (or a variant of such an antigen) that comprises one or more of the amino acid 
sequences encoded by (a) the DNA sequences of SEQ ID Nos.: 1, 2, 4-10, 13-25 and 
52; (b) the complements of such DNA sequences, or (c) DNA sequences substantially 
homologous to a sequence in (a) or (b). 

In further specific embodiments, the subject invention discloses 
polypeptides comprising at least an immunogenic portion of a M tuberculosis antigen 
(or a variant of such an antigen), which may or may not be soluble, that comprises one 
or more of the amino acid sequences encoded by (a) the DNA sequences of SEQ ID 
Nos.: 26-51, (b)the complements of such DNA sequences or (c) DNA sequences 
substantially homologous to a sequence in (a) or (b). 

In the specific embodiments discussed above, the M. tuberculosis 
antigens include variants that are encoded by DNA sequences which are substantially 
homologous to one or more of DNA sequences specifically recited herein. "Substantial 
homology," as used herein, refers to DNA sequences that are capable of hybridizing 
under moderately stringent conditions. Suitable moderately stringent conditions include 
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prewashing in a solution of 5X SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); hybridizing 
at 50°C-65°C, 5X SSC, overnight or, in the case of cross-species homology at 45°C, 
0.5X SSC; followed by washing twice at 65°C for 20 minutes with each of 2X, 0.5X 
and 0.2X SSC containing 0.1% SDS). Such hybridizing DNA sequences are also 
5 within the scope of this invention, as are nucleotide sequences that, due to code 
degeneracy, encode an immunogenic polypeptide that is encoded by a hybridizing DNA 
sequence. 

In a related aspect, the present invention provides fusion proteins 
comprising a first and a second inventive polypeptide or, alternatively, a polypeptide of 
10 the present invention and a known M tuberculosis antigen, such as the 38 kD antigen 
described above or ESAT-6 (SEQ ID Nos. 103 and 104), together with variants of such 
fusion proteins. The fusion proteins of the present invention may also include a linker 
peptide between the first and second polypeptides. 

A DNA sequence encoding a fusion protein of the present invention is 
15 constructed using known recombinant DNA techniques to assemble separate DNA 
sequences encoding the first and second polypeptides into an appropriate expression 
vector. The 3' end of a DNA sequence encoding the first polypeptide is ligated, with or 
without a peptide linker, to the 5' end of a DNA sequence encoding the second 
polypeptide so that the reading frames of the sequences are in phase to permit mRNA 
20 translation of the two DNA sequences into a single fusion protein that retains the 
biological activity of both the first and the second polypeptides. 

A peptide linker sequence may be employed to separate the first and the 
second polypeptides by a distance sufficient to ensure that each polypeptide folds into 
its secondary and tertiary structures. Such a peptide linker sequence is incorporated into 
25 the fusion protein using standard techniques well known in the art. Suitable peptide 
linker sequences may be chosen based on the following factors: (1) their ability to 
adopt a flexible extended conformation; (2) their inability to adopt a secondary structure 
that could interact with functional epitopes on the first and second polypeptides; and 
(3) the lack of hydrophobic or charged residues that might react with the polypeptide 
30 functional epitopes. Preferred peptide linker sequences contain Gly, Asn and Ser 
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residues. Other near neutral amino acids, such as Thr and Ala may also be used in the 
linker sequence. Amino acid sequences which may be usefully employed as linkers 
include those disclosed in Maratea etal., Gene 40:39-46, 1985; Murphy etal., Proc. 
Natl. Acad Set USA «:S258-8262, 1986; U.S. Patent No. 4,935,233 and U.S. Patent 
5 No. 4,751,180. The linker sequence may be from 1 to about 50 amino acids in length. 
Peptide sequences are not required when the first and second polypeptides have non- 
essential N-terminal amino acid regions that can be used to separate the functional 
domains and prevent steric interference. 

The ligated DNA sequences are operably linked to suitable 
10 transcriptional or translational regulatory elements. The regulatory elements 
responsible for expression of DNA are located only 5' to the DNA sequence encoding 
the first polypeptides. Similarly, stop codons require to end translation and 
transcription termination signals are only present 3' to the DNA sequence encoding the 
second polypeptide. 

15 In another aspect, the present invention provides methods for using one 

or more of the above polypeptides or fusion proteins (or DNA molecules encoding such 
polypeptides) to induce protective immunity against tuberculosis in a patient As used 
herein, a "patient" refers to any warm-blooded animal, preferably a human. A patient 
may be afflicted with a disease, or may be free of detectable disease and/or infection. In 

20 other words, protective immunity may be induced to prevent or treat tuberculosis. 

In this aspect, the polypeptide, fusion protein or DNA molecule is 
generally present within a pharmaceutical composition and/or a vaccine. 
Pharmaceutical compositions may comprise one or more polypeptides, each of which 
may contain one or more of the above sequences (or variants thereof), and a 

25 physiologically acceptable carrier. Vaccines may comprise one or more of the above 
polypeptides and a non-specific immune response enhancer, such as an adjuvant or a 
liposome (into which the polypeptide is incorporated). Such pharmaceutical 
compositions and vaccines may also contain other M. tuberculosis antigens, either 
incorporated into a combination polypeptide or present within a separate polypeptide. 
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Alternatively, a vaccine may contain DNA encoding one or more 
polypeptides as described above, such that the polypeptide is generated in situ. In such 
vaccines,_the DNA may be present within any of a variety of delivery systems known to 
those of ordinary skill in the art, including nucleic acid expression systems, bacterial 
5 and viral expression systems. Appropriate nucleic acid expression systems contain the 
necessary DNA sequences for expression in the patient (such as a suitable promoter and 
terminating signal). Bacterial delivery systems involve the administration of a 
bacterium (such as Bacillus-Calmette-Guerrin) that expresses an immunogenic portion 
of the polypeptide on its cell surface. In a preferred embodiment, the DNA may be 
1 0 introduced using a viral expression system {e.g., vaccinia or other pox virus, retrovirus, 
or adenovirus), which may involve the use of a non-pathogenic (defective), replication 
competent virus. Techniques for incorporating DNA into such expression systems are 
well known to those of ordinary skill in the art. The DNA may also be "naked," as 
described, for example, in Ulmer et al., Science 259: 1745-1749, 1993 and reviewed by 
15 Cohen, Science 25P:1691-1692, 1993. The uptake of naked DNA may be increased by 
coating the DNA onto biodegradable beads, which are efficiently transported into the 
cells. 

In a related aspect, a DNA vaccine as described above may be 
acbainistered simultaneously with or sequentially to either a polypeptide of the present 
20 invention or a known M. tuberculosis antigen, such as the 38 kD antigen described 
above. For example, administration of DNA encoding a polypeptide of the present 
invention, either "naked" or in a delivery system as described above, may be followed 
by administration of an antigen in order to enhance the protective immune effect of the 
vaccine. 

25 Routes and frequency of administration, as well as dosage, will vary 

from individual to individual and may parallel those currently being used in 
immunization using BCG. In general, the pharmaceutical compositions and vaccines 
may be administered by injection (e.g., intracutaneous, intramuscular, intravenous or 
subcutaneous), intranasally (e.g., by aspiration) or orally. Between 1 and 3 doses may 

30 be adnunistered for a 1-36 week period. Preferably, 3 doses are administered, at 
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intervals of 3-4 months, and booster vaccinations may be given periodically thereafter. 
Alternate protocols may be appropriate for individual patients. A suitable dose is an 
amount of polypeptide or DNA that, when adrninistered as described above, is capable 
of raising an immune response in an immunized patient sufficient to protect the patient 
5 from M. tuberculosis infection for at least 1-2 years. In general, the amount of 
polypeptide present in a dose (or produced in situ by the DNA in a dose) ranges from 
about 1 pg to about 100 mg per kg of host, typically from about 10 pg to about 1 mg, 
and preferably from about 100 pg to about 1 ug. Suitable dose sizes will vary with the 
size of the patient, but will typically range from about 0. 1 mL to about 5 mL. 
1 0 While ^ suitable carrier known to those of ordinary skill in the art may 

be employed in the pharmaceutical compositions of this invention, the type of carrier 
will vary depending on the mode of administration. For parenteral administration, such 
as subcutaneous injection, the carrier preferably comprises water, saline, alcohol, a fat, a 
wax or a buffer. For oral acirninistration, any of the above carriers or a solid carrier, 
15 such as mannitoL lactose, starch, magnesium stearate, sodium saccharine, talcum, 
cellulose, glucose, sucrose, and magnesium carbonate, may be employed. 
Biodegradable microspheres (e.g., polylactic galactide) may also be employed as 
carriers for the pharmaceutical compositions of this invention. Suitable biodegradable 
microspheres are disclosed, for example, in U.S. Patent Nos. 4,897,268 and 5,075,109. 
20 of a variety of adjuvants may be employed in the vaccines of this 

invention to nonspecifically enhance the immune response. Most adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum 
hydroxide or mineral oil, and a nonspecific stimulator of immune responses, such as 
lipid A, Bortadella pertussis or Mycobacterium tuberculosis. Suitable adjuvants are 
25 commercially available as, for example, Freund's Incomplete Adjuvant and Freund's 
Complete Adjuvant (Difco Laboratories) and Merck Adjuvant 65 (Merck and 
Company, Inc., Rahway, NJ). Other suitable adjuvants include alum, biodegradable 
microspheres, monophosphoryl lipid A and quil A. 

In another aspect, this invention provides methods for using one or more 
30 of the polypeptides described above to diagnose tuberculosis using a skin test. As used 
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herein, a "skin test" is any assay performed directly on a patient in which a delayed-type 
hypersensitivity (DTH) reaction (such as swelling, reddening or dermatitis) is measured 
following^ intradermal injection of one or more polypeptides as described above. Such 
injection may be achieved using any suitable device sufficient to contact the 
5 polypeptide or polypeptides with dermal cells of the patient, such as a tuberculin 
syringe or 1 mL syringe. Preferably, the reaction is measured at least 48 hours after 
injection, more preferably 48-72 hours. 

The DTH reaction is a cell-mediated immune response, which is greater 
in patients that have been exposed previously to the test antigen (/.*., the immunogenic 
10 portion of the polypeptide employed, or a variant thereof). The response may be 
measured visually, using a ruler. In general, a response that is greater than about 0.5 cm 
in diameter, preferably greater than about 1.0 cm in diameter, is a positive response, 
indicative of tuberculosis infection, which may or may not be manifested as an active 
disease. 

15 The polypeptides of this invention are preferably formulated, for use in a 

skin test, as pharmaceutical compositions containing a polypeptide and a 
physiologically acceptable carrier, as described above. Such compositions typically 
contain one or more of the above polypeptides in an amount ranging fiom about 1 ug to 
about 100 ug, preferably from about 10 ug to about 50 ug in a volume of 0.1 mL. 
20 Preferably, the carrier employed in such pharmaceutical compositions is a saline 
solution with appropriate preservatives, such as phenol and/or Tween 80™. 

In a preferred embodiment, a polypeptide employed in a skin test is of 
sufficient size such that it remains at the site of injection for the duration of the reaction 
period. In general, a polypeptide that is at least 9 amino acids in length is sufficient. 
25 The polypeptide is also preferably broken down by macrophages within hours of 
injection to allow presentation to T-cells. Such polypeptides may contain repeats of one 
or more of the above sequences and/or other immunogenic or nonimmunogenic 
sequences. 
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The following Examples are offered by way of illustration and not by 
way of limitation. 

EXAMPLES 

5 

EXAMPLE 1 

Purification and Characterization of Polypeptides 

FROM M. TUBERCULOSIS CULTURE FILTRATE 

10 This example illustrates the preparation of M. tuberculosis soluble 

polypeptides from culture filtrate. Unless otherwise noted, all percentages in the 
following example are weight per volume. 

M tuberculosis (either H37Ra, ATCC No. 25177, or H37Rv, ATCC 
No. 25618) was cultured in sterile GAS media at 37°C for fourteen days. The media 
15 was then vacuum filtered (leaving the bulk of the cells) through a 0.45 \x filter into a 
sterile 2.5 L bottle. The media was next filtered through a 0.2 ji filter into a sterile 4 L 
bottle and NaN 3 was added to the culture filtrate to a concentration of 0.04%. The 
bottles were then placed in a 4°C cold room. 

The culture filtrate was concentrated by placing the filtrate in a 12 L 
20 reservoir that had been autoclaved and feeding the filtrate into a 400 ml Amicon stir cell 
which had been rinsed with ethanol and contained a 10,000 kDa MWCO membrane. 
The pressure was maintained at 60 psi using nitrogen gas. This procedure reduced the 
12 L volume to approximately 50 ml. 

The culture filtrate was dialyzed into 0. 1% ammonium bicarbonate using 
25 a 8,000 kDa MWCO cellulose ester membrane, with two changes of ammonium 
bicarbonate solution. Protein concentration was then determined by a commercially 
available BCA assay (Pierce, Rockford, IL). 

The dialyzed culture filtrate was then lyophilized, and the polypeptides 
resuspended in distilled water. The polypeptides were dialyzed against 0.01 mM 1,3 
30 bis[tris(hydroxymethyl)-methylamino]propane, pH 7.5 (Bis-Tris propane buffer), the 
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initial conditions for anion exchange chromatography. Fractionation was performed 
using gel profusion chromatography on a POROS 146 II Q/M anion exchange column 
4.6 mm x 100 mm (Perseptive BioSystems, Framingham, MA) equilibrated in 0.01 mM 
Bis-Tris propane buffer pH 7.5. Polypeptides were eluted with a linear 0-0.5 M NaCl 
5 gradient in the above buffer system. The column eluent was monitored at a wavelength 
of 220 nm. 

The pools of polypeptides eluting from the ion exchange column were 
dialyzed against distilled water and lyophilized. The resulting material was dissolved in 
0.1% trifluoroacetic acid (TFA) pH 1.9 in water, and the polypeptides were purified on 

10 a Delta-Pak C18 column (Waters, Milford, MA) 300 Angstrom pore size, 5 micron 
particle size (3.9 x 150 mm). The polypeptides were eluted from the column with a 
linear gradient from 0-60% dilution buffer (0.1% TFA in acetonitrile). The flow rate 
was 0.75 ml/minute and the HPLC eluent was monitored at 214 nm. Fractions 
containing the eluted polypeptides were collected to maximize the purity of the 

15 individual samples. Approximately 200 purified polypeptides were obtained. 

The purified polypeptides were then screened for the ability to induce T- 
cell proliferation in PBMC preparations. The PBMCs from donors known to be PPD 
skin test positive and whose T-cells were shown to proliferate in response to PPD and 
crude soluble proteins from MTB were cultured in medium comprising RPMI 1640 

20 supplemented with 10% pooled human serum and 50^g/ml gentamicin. Purified 
polypeptides were added in duplicate at concentrations of 0.5 to 10 \xg/mL. After six 
days of culture in 96-well round-bottom plates in a volume of 200 jj.1, 50 pi of medium 
was removed from each well for determination of IFN-y levels, as described below. 
The plates were then pulsed with 1 p.Ci/well of tritiated thymidine for a further 18 

25 hours, harvested and tritium uptake determined using a gas scintillation counter. 
Fractions that resulted in proliferation in both replicates three fold greater than the 
proliferation observed in cells cultured in medium alone were considered positive. 

IFN-y was measured using an enzyme-linked immunosorbent assay 
(ELISA). ELISA plates were coated with a mouse monoclonal antibody directed to 

30 human IFN-y (PharMingen, San Diego, CA) in PBS for four hours at room temperature. 
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Wells were then blocked with PBS containing 5% (W/V) non-fat dried milk for 1 hour 
at room temperature. The plates were then washed six times in PBS/0.2% TWEEN-20 
and samples diluted 1:2 in culture medium in the ELISA plates were incubated 
overnight at room temperature. The plates were again washed and a polyclonal rabbit 
5 anti-human IFN-y serum diluted 1:3000 in PBS/10% normal goat serum was added to 
each well. The plates were then incubated for two hours at room temperature, washed 
and horseradish peroxidase-coupled anti-rabbit IgG (Sigma Chemical So., St Louis, 
MO) was added at a 1 :2000 dilution in PBS/5% non-fat dried milk. After a further two 
hour incubation at room temperature, the plates were washed and TMB substrate added, 

10 The reaction was stopped after 20 min with 1 N sulfuric acid. Optical density was 
determined at 450 nm using 570 nm as a reference wavelength. Fractions that resulted 
in both replicates giving an OD two fold greater than the mean OD from cells cultured 
in medium alone, plus 3 standard deviations, were considered positive. 

For sequencing, the polypeptides were individually dried onto 

15 Biobrene™ (Perkin Elmer/Applied BioSystems Division, Foster City, CA) treated glass 
fiber filters. The filters with polypeptide were loaded onto a Perkin Elmer/Applied 
BioSystems Division Precise 492 protein sequencer. The polypeptides were sequenced 
from the amino terminal and using traditional Edman chemistry. The amino acid 
sequence was determined for each polypeptide by comparing the retention time of the 
20 PTH amino acid derivative to the appropriate PTH derivative standards. 

Using the procedure described above, antigens having the following 
N-terminal sequences were isolated: 

(a) Asp-Prc^Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Xaa-Asn-Tyr-GIy- 
Gln- Val-Val-Ala-Ala-Leu; (SEQ ID No. 54) 
25 (b) Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Thr-Pro-Ala-Pro- 

Ser; (SEQIDNo. 55) 

(c) Ala-AIa-Met-Lys-Pro-Arg-Thr-Gly-Asp-Gly-Pro-Leu-Glu-Ala- 
Ala-Lys-Glu-Gly-Arg; (SEQ ID No. 56) 

(d) Tyr-Tyr-Trp-Cys-Pro-Gly-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly- 
30 Pro; (SEQ ID No. 57) 
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(e) Asp-Ile-Gly-Ser-GIu-Ser-Thr-Glu-Asp-Gln-Gln-Xaa-Ala-Val; 
(SEQ ID No. 58) 

(f) Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro; (SEQ ID 
No. 59) 

(g) Asp-Pro-Glu-Pro-Ala-Pro-Pro-Val-Pro-Thr-Ala-Ala-Ala-Ala- 
Pro-Pro-Ala; (SEQ ID No. 60) and 

(h) Ala-Pro-Lys-Thr-Tyr-Xaa-Glu-GIu-Leu-Lys-Gly-Thx-Asp-Thr- 
Gly; (SEQ ID No. 61) 

wherein Xaa may be any amino acid. 

An additional antigen was isolated employing a microbore HPLC 
purification step in addition to the procedure described above. Specifically, 20 jjI of a 
fraction comprising a mixture of antigens from the chromatographic purification step 
previously described, was purified on an Aquapore CI 8 column (Perkin Elmer/Applied 
Biosystems Division, Foster City, CA) with a 7 micron pore size, column size 1 mm x 
15 100 mm, in a Perkin Elmer/Applied Biosystems Division Model 172 HPLC. Fractions 
were eluted from the column with a linear gradient of 1%/minute of acetonitrile 
(containing 0.05% TFA) in water (0.05% TFA) at a flow rate of 80 ul/minute. The 
eluent was monitored at 250 nm. The original fraction was separated into 4 major peaks 
plus other smaller components and a polypeptide was obtained which was shown to 
have a molecular weight of 12.054 Kd (by mass spectrometry) and the following N- 
terminal sequence: 

(i) Asp-Pro-Ala-Ser-Aia-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Gln- 
Thr-Ser-Leu-Leu-Asn-Asn-Leu-Ala-Asp-Pro-Asp-Val-Ser-Phe- 
Ala-Asp (SEQ ID No. 62). 
This polypeptide was shown to induce proliferation and IFN-y production in PBMC 
preparations using the assays described above. 

Additional soluble antigens were isolated from M. tuberculosis culture 
filtrate as follows. M. tuberculosis culture filtrate was prepared as described above. 
Following dialysis against Bis-Tris propane buffer, at pH 5.5, fractionation was 
30 performed using anion exchange chromatography on a Poros QE column 4.6 x 100 mm 
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(Perseptive Biosystems) equilibrated in Bis-Tris propane buffer pH 5.5. Polypeptides 
were eluted with a linear 0-1.5 M NaCl gradient in the above buffer system at a flow 
rate of 10 ml/min. The column eluent was monitored at a wavelength of 214 nm. 

The fractions eluting from the ion exchange column were pooled and 
5 subjected to reverse phase chromatography using a Poros R2 column 4.6 x 100 mm 
(Perseptive Biosystems). Polypeptides were eluted from the column with a linear 
gradient from 0-100% acetonitrile (0.1% TFA) at a flow rate of 5 ml/min. The eluent 
was monitored at 2 1 4 nm. 

Fractions containing the eluted polypeptides were lyophilized and 
10 resuspended in 80 ^1 of aqueous 0.1% TFA and further subjected to reverse phase 
chromatography on a Vydac C4 column 4.6 x 150 mm (Western Analytical, Temecula, 
CA) with a linear gradient of 0-100% acetonitrile (0.1% TFA) at a flow rate of 2 
ml/min. Eluent was monitored at 214 nm. 

The fraction with biological activity was separated into one major peak 
15 plus other smaller components. Western blot of this peak onto PVDF membrane 
revealed three major bands of molecular weights 14 Kd, 20 Kd and 26 Kd. These 
polypeptides were determined to have the following N-terminal sequences, respectively: 

0) Xaa-Asp-Ser-Glu-Lys-Ser-Ala-Thr-IIe-Lys-Val-Thr-Asp-Ala- 
Ser;(SEQ ID No. 134) 

20 (k) Ala-Gly-Asp-Thr-Xaa-Ile-Tyr-Ile-Val-Gly-Asn-Leu-Thr-Ala- 

Asp; (SEQ ID No. 135) and 

(1) Ala-Pro-Glu-Ser-GIy-Ala-Gly-Leu-Gly-Gly-Thr-Val-Gln-Ala- 
Gly; (SEQ ID No. 136), wherein Xaa may be any amino acid. 
Using the assays described above, these polypeptides were shown to induce 
25 proliferation and IFN-y production in PBMC preparations. Figs. 1A and B show the 
results of such assays using PBMC preparations from a first and a second donor, 
respectively. 

DNA sequences that encode the antigens designated as (a), (c), (d) and 
(g) above were obtained by screening a genomic M. tuberculosis library using 32 P end 
30 labeled degenerate oligonucleotides corresponding to the N-terminal sequence and 
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containing M. tuberculosis codon bias. The screen performed using a probe 
corresponding to antigen (a) above identified a clone having the sequence provided in 
SEQ ID No. 101. The polypeptide encoded by SEQ ID No. 101 is provided in SEQ ID 
No. 102. The screen performed using a probe corresponding to antigen (g) above 
' identified a clone having the sequence provided in SEQ ID No. 52. The polypeptide 
encoded by SEQ ID No. 52 is provided in SEQ ID No. 53. The screen performed 
using a probe corresponding to antigen (d) above identified a clone having the sequence 
provided in SEQ ID No. 24, and the screen performed with a probe corresponding to 
antigen (c) identified a clone having the sequence provided in SEQ ID No: 25. 

The above amino acid sequences were compared to known amino acid 
sequences in the gene bank using the DNA STAR system. The database searched 
contains some 173,000 proteins and is a combination of the Swiss, PIR databases along 
with translated protein sequences (Version 87). No significant homologies to the arnino 
acid sequences for antigens (a)-(h) and (1) were detected. 

The amino acid sequence for antigen (i) was found to be homologous to 
a sequence from M. leprae. The full length M. leprae sequence was amplified from 
genomic DNA using the sequence obtained from GENBANK. This sequence was then 
used to screen the M. tuberculosis library described below in Example 2 and a full 
length copy of the M. tuberculosis homologue was obtained (SEQ ID No. 99). 

The amino acid sequence for antigen (j) was found to be homologous to 
a known M. tuberculosis protein translated from a DNA sequence. To the best of the 
inventors' knowledge, this protein has not been previously shown to possess T-cell 
stimulatory activity. The amino acid sequence for antigen (k) was found to be related to 
a sequence from M. leprae. 

In the proliferation and IFN-y assays described above, using three PPD 
positive donors, the results for representative antigens provided above are presented in 
Table 1: 
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TABLE 1 

Results of PBMC Proliferation and IFN-v Assays 



Sequence 


Proliferation 1 IFN-y 


(a) 


+ 




(c) 


+++ 


+++ 


(d) 


++ 


++ 


(g) 


+++ 


+++ 


(h) 


+++ 


+++ 



In Table 1, responses that gave a stimulation index (SI) of between 2 and 
4 (compared to cells cultured in medium alone) were scored as +, an SI of 4-8 or 2-4 at 
a concentration of 1 or less was scored as ++ and an SI of greater than 8 was scored 
as +++. The antigen of sequence (i) was found to have a high SI (+-H-) for one donor 
and lower SI (++ and +) for the two other donors in both proliferation and IFN-y assays. 
These results indicate that these antigens are capable of inducing proliferation and/or 
interferon-y production. 

EXAMPLE 2 

Use of Patient Sera to Isolate M. Tuberculosis Antigens 
15 

This example illustrates the isolation of antigens from M. tuberculosis 
lysate by screening with serum from M. tuberculosis-infected individuals. 

Dessicated M. tuberculosis H37Ra (Difco Laboratories) was added to a 
2% NP40 solution, and alternately homogenized and sonicated three times. The 
20 resulting suspension was centrifuged at 13,000 rpm in microfuge tubes and the 
supernatant put through a 0.2 micron syringe filter. The filtrate was bound to Macro 
Prep DEAE beads (BioRad, Hercules, CA). The beads were extensively washed with 
20 mM Tris pH 7.5 and bound proteins eluted with 1M NaCl. The 1M NaCl elute was 
dialyzed overnight against 10 mM Tris, pH 7.5. Dialyzed solution was treated with 



5 



10 



o o 

WO 97/09428 PCT/US96/14674 



5 



DNase and RNase at 0.05 mg/ml for 30 min. at room temperature and then with a-D- 
mannosidase, 0.5 U/mg at pH 4.5 for 3-4 hours at room temperature. After returning to 
pH 7.5,_the material was fractionated via FPLC over a Bio Scale-Q-20 column 
(BioRad). Fractions were combined into nine pools, concentrated in a Centriprep 10 
5 (Amicon, Beverley, MA) and then screened by Western blot for. serological activity 
using a serum pool from M. tuberculosis-infected patients which was not 
immunoreactive with other antigens of the present invention. 

The most reactive fraction was run in SDS-PAGE and transferred to 
PVDF. A band at approximately 85 Kd was cut out yielding the sequence: 

0 (m) Xaa-Tyr-Ile-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-rie-Val-Pro-Gly-Lys- 

Ile-Asn-Val-His-Leu-Val: (SEQ ID No. 137), wherein Xaa may 
be any amino acid. 
Comparison of this sequence with those in the gene bank as described 
above, revealed no significant homologies to known sequences. 



EXAMPLE 3 

Preparation of DNA Sfom knces Encoding M. tuberculosis Amnroc 

This example illustrates the preparation of DNA sequences encoding 
M. tuberculosis antigens by screening a M. tuberculosis expression library with sera 
obtained from patients infected with M. tuberculosis, or with anti-sera raised against 
soluble M. tuberculosis antigens. 



A - Preparation of M tu berculosis Soluble Antigens usrNG Rabrtt A*m . 

SERA " " 

Genomic DNA was isolated from the M. tuberculosis strain H37Ra. The 
DNA was randomly sheared and used to construct an expression library using the 
Lambda ZAP expression system (Stratagene, La Jolla, CA). Rabbit anti-sera was 
generated against secretory proteins of the M. tuberculosis strains H37Ra, H37Rv and 
Erdman by immunizing a rabbit with concentrated supernatant of the M. tuberculosis 
cultures. Specifically, the rabbit was first immunized subcutaneously with 200 ug of 
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protein antigen in a total volume of 2 ml containing 10 ug muramyl dipeptide 
(Calbiochem, La Jolla, CA) and 1 ml of incomplete Freund's adjuvant Four weeks later 
the rabbit was boosted subcutaneously with 100 ug antigen in incomplete Freund's 
adjuvant. Finally, the rabbit was immunized intravenously four weeks later with 50 ug 
5 protein antigen. The anti-sera were used to screen the expression library as described in 
Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratories, Cold Spring Harbor, NY, 1989. Bacteriophage plaques expressing 
immunoreactive antigens were purified. Phagemid from the plaques was rescued and 
the nucleotide sequences of the M. tuberculosis clones deduced. 
10 Thirty two clones were purified. Of these, 25 represent sequences that 

have not been previously identified in human M. tuberculosis. Recombinant antigens 
were expressed and purified antigens used in the immunological analysis described in 
Example 1. Proteins were induced by IPTG and purified by gel elution, as described in 
Skeiky etal., J. Exp. Med 7*7:1527-1537, 1995. Representative sequences of DNA 
15 molecules identified in this screen are provided in SEQ ID Nos.: 1-25. The 
corresponding predicted amino acid sequences are shown in SEQ ID Nos. 63-87. 

On comparison of these sequences with known sequences in the gene 
bank using the databases described above, it was found that the clones referred to 
hereinafter as TbRA2A, TbRA16, TbRA18, and TbRA29 (SEQ ID Nos. 76, 68; 70, 75) 
20 show some homology to sequences previously identified in Mycobacterium leprae but 
not in M. tuberculosis. TbRAll, TbRA26, TbRA28 and TbDPEP (SEQ ID Nos.: 65, 
73, 74, 53) have been previously identified in M. tuberculosis. No significant 
homologies were found to TbRAl, TbRA3, TbRA4, TbRA9, TbRAlO, TbRA13, 
TbRA17, TbRal9, TbRA29, TbRA32, TbRA36 and the overlapping clones TbRA35 
25 and TbRA12 (SEQ ID Nos. 63, 77, 81, 82, 64, 67, 69, 71, 75, 78, 80, 79, 66). The 
clone TbRa24 is overlapping with clone TbRa29. 

The results of PBMC proliferation and interferon-y assays performed on 
representative recombinant antigens, and using T-cell preparations from several 
different M. tuberculosis-immune patients, are presented in Tables 2 and 3, 
30 respectively. 
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In Tables 2 and 3, responses that gave a stimulation index (SI) of 
between 1.2 and 2 (compared to cells cultured in medium alone) were scored as ±, a SI 
of 2-4 was scored as +, as SI of 4-8 or 2-4 at a concentration of 1 ug or less was scored 
as ++ and an SI of greater than 8 was scored as +++. In addition, the effect of 
5 concentration on proliferation and interferon-y production is shown for two of the above 
antigens in the attached Figure. For both proliferation and interferon-y production, 
TbRa3 was scored as ++ and TbRa9 as +. 

These results indicate that these soluble antigens can induce proliferation 
and/or interferon-y production in T-cells derived from an M. tuberculosis-immune 
1 0 individual. 



B - Use of Patient Sfra to Tp fntify DNA Skoi tf/mpes Encodinc. 

M. TUBERCt IT.OSIS ANTIGENS 

The genomic DNA library described above, and an additional H37Rv 
15 library, were screened using pools of sera obtained from patients with active 
tuberculosis. To prepare the H37Rv library, M. tuberculosis strain H37Rv genomic 
DNA was isolated, subjected to partial SauSA digestion and used to construct an 
expression library using the Lambda Zap expression system (Stratagene, La Jolla, Ca). 
Three different pools of sera, each containing sera obtained from three individuals with 
20 active pulmonary or pleural disease, were used in the expression screening. The pools 
were designated TbL, TbM and TbH, referring to relative reactivity with H37Ra lysate 
{i.e., TbL = low reactivity, TbM = medium reactivity and TbH = high reactivity) in both 
ELISA and immunoblot format. A fourth pool of sera from seven patients with active 
pulmonary tuberculosis was also employed. All of the sera lacked increased reactivity 
25 with the recombinant 38 kD M. tuberculosis H37Ra phosphate-binding protein. 

All pools were pre-adsorbed with E. coli lysate and used to screen the 
H37Ra and H37Rv expression libraries, as described in Sambrook et al., Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratories, Cold Spring Harbor, 
NY, 1989. Bacteriophage plaques expressing immunoreactive antigens were purified. 
Phagemid from the plaques was rescued and the nucleotide sequences of the 
M. tuberculosis clones deduced. 
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Thirty two clones were purified. Of these, 3 1 represented sequences that 
had not been previously identified in human M tuberculosis. Representative sequences 
of the DNA molecules identified are provided in SEQ ID Nos.: 26-51 and 105 Of 
these, TbH-8 and TbH-8-2 (SEQ. ID NO. 105) are non-contiguous DNA sequences 
from the same clone, and TbH-4 (SEQ. ID NO. 43) and TbH-4-FWD (SEQ. ID NO. 44) 
are non-contiguous sequences from the same clone. Amino acid sequences for the 
antigens hereinafter identified as Tb38-1, TbH-4, TbH-8, TbH-9, and TbH-12 are 
shown in SEQ ID Nos.: 88-92. Comparison of these sequences with known sequences 
in the gene bank using the databases identified above revealed no significant 
homologies to TbH-4, TbH-8, TbH-9 and TbM-3, although weak homologies were 
found to TbH-9. TbH-12 was found to be homologous to a 34 kD antigenic protein 
previously identified in M. paratuberculosis (Acc. No. S28515). Tb38-1 was found to 
be located 34 base pairs upstream of the open reading frame for the antigen ESAT-6 
previously identified in M. bovis (Acc. No. U34848) and in M tuberculosis (Sorensen 
1 5 et aL, Infec. Immun. 63: 1 7 1 0- 1 71 7, 1 995). 

Probes derived from Tb38-1 and TbH-9, both isolated from an H37Ra 
library, were used to identify clones in an H37Rv library. Tb38-1 hybridized to 
Tb38-lF2,Tb38-lF3,Tb38-lF5andTb38-lF6(SEQ.IDNOS. 112, 113, 116, 118, and 
119). (SEQ ID NOS. 112 and 113 are non-contiguous sequences from clone Tb38- 
20 1 F2.) Two open reading frames were deduced in Tb38-IF2; one corresponds to Tb37FL 
(SEQ. ID. NO. 114), the second, a partial sequence, may be the homologue of Tb38-1 
and is called Tb38-IN (SEQ. ID NO. 115). The deduced amino acid sequence of Tb38- 
1F3 is presented in SEQ. ID. NO. 117. A TbH-9 probe identified three clones in the 
H37Rv library: TbH-9-FL (SEQ. ID NO. 106), which may be the homologue of TbH-9 
25 (R37Ra), TbH-9-1 (SEQ. ID NO. 108), and TbH-9-4 (SEQ. ID NO. 1 10), all of which 
are highly related sequences to TbH-9. The deduced amino acid sequences for these 
three clones are presented in SEQ ID NOS. 107, 109 and 1 1 1. 

The results of T-cell assays performed on.Tb38-l, ESAT-6 and other 
representative recombinant antigens are presented in Tables 4A, B and 5, respectively, 
30 below: 
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TABLE 4A 

Results of PBMC Prot oration to Rfpresentattvf ANTir.FMQ 



Antigen 












Donor 




1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


Tb38.1 


+++ 


+ 








++ 




+ 




++ 


-H+ 


ESAT-6 




+ 


+ 


+ 
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++ 
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+ 1 
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TABLE 4B 

Results of PBMC Intf.rferon-v Produ ction to Rfpresentattvf Ai*nr,ENs 
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TABLE 5 

Summary of T-celi. Responses to Representative Anttctrsk 





Proliferation 


Interferon-y 




Antigen 


patient 4 


patient 5 


patient 6 


patient 4 


patient 5 


patient 6 


total 
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+++ 






13 
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4 
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TbL23 
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++ 


++ 
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7.5 
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++ 
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-in- 


++ 
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- control 














0 



These results indicate that both the inventive M. tuberculosis antigens 
and ESAT-6 can induce proliferation and/or interferon-y production in T-cells derived 
from an M. tuberculosis-immune individual. To the best of the inventors' knowledge, 
ESAT-6 has not been previously shown to stimulate human immune responses 

A set of six overlapping peptides covering the amino acid sequence of 
the antigen Tb38-1 was constructed using the method described in Example 4. The 
sequences of these peptides, hereinafter referred to as pep 1-6, are provided in SEQ ID 
Nos. 93-98, respectively. The results of T-cell assays using these peptides are shown in 
Tables 6 and 7. These results confirm the existence, and help to localize T-cell epitopes 
within Tb38-1 capable of inducing proliferation and interferon-y production in T-cells 
derived from an M. tuberculosis immune individual. 
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EXAMPLE 4 

Purification and Charac terization of a Polypeptide from Tuberct jt tm Puriftfti 

Protein Derivative 

5 

An M tuberculosis polypeptide was isolated from tuberculin purified 
protein derivative (PPD) as follows. 

PPD was prepared as published with some modification (Seibert, F. et 
al., Tuberculin purified protein derivative. Preparation and analyses of a large quantity 
1 0 for standard. The American Review of Tuberculosis 44:9-25, 1 94 1 ). 

M tuberculosis Rv strain was grown for 6 weeks in synthetic medium in 
roller bottles at 37°C. Bottles containing the bacterial growth were then heated to 100° 
C in water vapor for 3 hours. Cultures were sterile filtered using a 0.22 jx filter and the 
liquid phase was concentrated 20 times using a 3 kD cut-off membrane. Proteins were 
15 precipitated once with 50% ammonium sulfate solution and eight times with 25% 
ammonium sulfate solution. The resulting proteins (PPD) were fractionated by reverse 
phase liquid chromatography (RP-HPLC) using a CI 8 column (7-8 x 300 mM; Waters, 
Milford, MA) in a Biocad HPLC system (Perseptive Biosystems, Framingham, MA). 
Fractions were eluted from the column with a linear gradient from 0-100% buffer (0.1% 
20 TFA in acetonitrile). The flow rate was 10 ml/minute and eluent was monitored at 214 
run and 280 nm. 

Six fractions were collected, dried, suspended in PBS and tested 
individually in M tuberculosis-infected guinea pigs for induction of delayed type 
hypersensitivity (DTH) reaction. One fraction was found to induce a strong DTH 

25 reaction and was subsequently fractionated further by RP-HPLC on a microbore Vydac 
CI 8 column (Cat. No. 218TP5115) in a Perkin Elmer/Applied Biosystems Division 
Model 172 HPLC. Fractions were eluted with a linear gradient from 5-100% buffer 
(0.05% TFA in acetonitrile) with a flow rate of 80 ^il/minute. Eluent was monitored at 
215 nm. Eight fractions were collected and tested for induction of DTH in A£ 

30 tuberculosis-infected guinea pigs. One fraction was found to induce strong DTH of 
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about 16 mm induration. The other fractions did not induce detectable DTH. The 
positive fraction was submitted to SDS-PAGE gel electrophoresis and found to contain 
a single protein band of approximately 12 kD molecular weight. 

This polypeptide, herein after referred to as DPPD, was sequenced from 
5 the amino terminal using a Perkin Elmer/Applied Biosystems Division Precise 492 
protein sequencer as described above and found to have the N-terminal sequence shown 
in SEQ ID No.: 129. Comparison of this sequence with known sequences in the gene 
bank as described above revealed no known homologies. Four cyanogen bromide 
fragments of DPPD were isolated and found to have the sequences shown in SEQ ID 
10 Nos.: 130-133. 

The ability of the antigen DPPD to stimulate human PBMC to proliferate 
and to produce IFN-y was assayed as described in Example 1. As shown in Table 8, 
DPPD was found to stimulate proliferation and elicit production of large quantities of 
IFN-y; more than that elicited by commercial PPD. 



15 



TABLE 8 

Results of Prolif eration and Interferon-y Assays to DPPD 



PBMC Donor j Stimulator 


Proliferation (CPM) 


IFN-y (OD 4J0 ) 


A 


Medium 


1,089 


0.17 




PPD (commercial) 


8,394 


1.29 




DPPD 


13,451 


2.21 










B 


Medium 


450 


0.09 




PPD (commercial) 


3,929 


1.26 




DPPD 


6,184 


1.49 










C 


Medium 


541 


0.11 




PPD (commercial) 


8,907 


0.76 




DPPD 


23,024 


>2.70 
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EXAMPLE 5 
Synthesis of Svnthfttc Polypfpttdf*; 



Polypeptides may be synthesized on a Millipore 9050 peptide 
synthesizer using FMOC chemistry with HPTU (0-Benzotriazole-N,N,N\N'- 
tetramethyluronium hexafluorophosphate) activation. A Gly-Cys-Gly sequence may be 
attached to the amino terminus of the peptide to provide a method of conjugation or 
10 labeling of the peptide. Cleavage of the peptides from the solid support may be carried 
out using the following cleavage mixture: trifluoroacetic 
acid:ethanedithiol:thioanisole:water:phenol (40:1:2:2:3). After cleaving for 2 hours, the 
peptides may be precipitated in cold methyl-t-butyl-ether. The peptide pellets may then 
be dissolved in water containing 0.1% trifluoroacetic acid (TFA) and lyophilized prior 
15 to purification by C18 reverse phase HPLC. A gradient of 0%-60% acetonitrile 
(containing 0.1% TFA) in water (containing 0.1% TFA) may be used to elute the 
peptides. Following lyophilization of the pure fractions, the peptides may be 
characterized using electrospray mass spectrometry and by amino acid analysis. 



From the foregoing, it will be appreciated that, although specific 
embodiments of the invention have been described herein for the purpose of illustration, 
various modifications may be made without deviating from the spirit and scope of the 
invention. 



WO 97/09428 



PCT/US96/14674 



46 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANTS: Corixa Corporation 

(ii) TITLE OF INVENTION: COMPOUNDS AND METHODS FOR IMMUNOTHERAPY 
AND DIAGNOSIS OF TUBERCULOSIS 

(iii) NUMBER OF SEQUENCES: 137 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: SEED and BERRY LLP 

(B) STREET: 6300 Columbia Center. 701 Fifth Avenue 

(C) CITY: Seattle 

(D) STATE: Washington 

(E) COUNTRY: USA 

(F) ZIP: 98104-7092 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patent In Release #1.0. Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 27-AUG-1996 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Maki . David J. 

(B) REGISTRATION NUMBER: 31.392 

(C) REFERENCE/DOCKET NUMBER: 210121. 411PC 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (206) 622-4900 

(B) TELEFAX: (206) 682-6031 



WO 97/09428 PCI7US96/14674 

83 

CGACGTTTAA CAAGCCCAGC GCCTATTCGA CCGGTTGGGC ATTGTGGGTT GTGTTGGCTT 480 

TCATCGTGTT CCAGGCGGTT GCGGCAGTCC TGGCGCTCTT GGTGGAGACC GGCGCTATCA 540 

CCGCGCCGGC GCCGCGGCCC AAGTTCGACC CGTATGGACA GTACGGGCGG TACGGGCAGT 600 

ACGGGCAGTA CGGGGTGCAG CCGGGTGGGT ACTACGGTCA GCAGGGTGCT CAGCAGGCCG 660 

CGGGACTGCA GTCGCCCGGC CCGCAGCAGT CTCCGCAGCC TCCCGGATAT GGGTCGCAGT 720 

ACGGCGGCTA TTCGTCCAGT CCGAGCCAAT CGGGCAGTGG ATACACTGCT CAGCCCCCGG 780 

CCCAGCCGCC GGCGCAGTCC GGGTCGCAAC AATCGCACCA GGGCCCATCC ACGCCACCTA 840 

CCGGCTTTCC GAGCTTCAGC CCACCACCAC CGGTCAGTGC CGGGACGGGG TCGCAGGCTG 900 

GTTCGGCTCC AGTCAACTAT TCAAACCCCA GCGGGGGCGA GCAGTCGTCG TCCCCCGGGG 960 

GGGCGCCGGT CTAACCGGGC GTTCCCGCGT CCGGTCGCGC GTGTGCGCGA AGAGTGAACA 1020 

GGGTGTCAGC AAGCGCGGAC GATCCTCGTG CCGAATTC 1058 
(2) INFORMATION FOR SEQ ID N0:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:46: 

CGGCACGAGA GACCGATGCC GCTACCCTCG CGCAGGAGGC AGGTAATTTC GAGCGGATCT 60 

CCGGCGACCT GAAAACCCAG ATCGACCAGG TGGAGTCGAC GGCAGGTTCG TTGCAGGGCC 120 

AGTGGCGCGG CGCGGCGGGG ACGGCCGCCC AGGCCGCGGT GGTGCGCTTC CAAGAAGCAG 180 

CCAATAAGCA GAAGCAGGAA CTCGACGAGA TCTCGACGAA TATTCGTCAG GCCGGCGTCC 240 

AATACTCGAG GGCCGACGAG GAGCAGCAGC AGGCGCTGTC CTCGCAAATG GGCTTCTGAC 300 
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CCGCTAATAC GAAAAGAAAC GGAGCAA 
(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:47: 
CGGTCGCGAT GATGGCGTTG TCGAACGTGA CCGATTCTGT ACCGCCGTCG TTGAGATCAA 
CCAACAACGT GTTGGCGTCG GCAAATGTGC CGNACCCGTG GATCTCGGTG ATCTTGTTCT 
TCTTCATCAG GAAGTGCACA CCGGCCACCC TGCCCTCGGN TACCTTTCGG 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 127 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:48: 
GATCCGGCGG CACGGGGGGT GCCGGCGGCA GCACCGCTGG CGCTGGCGGC AACGGCGGGG 
CCGGGGGTGG CGGCGGAACC GGTGGGTTGC TCTTCGGCAA CGGCGGTGCC GGCGGGCACG 
GGGCCGT 

(2) INFORMATION FOR SEQ ID NO: 49: 



o o 
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(2) INFORMATION FOR SEQ ID NO: 87: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 88 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:87: 

Val Gin Cys Arg Val Trp Leu Glu He Gin Trp Arg Gly Met Leu Gly 
1 5 10 15 

Ala Asp Gin Ala Arg Ala Gly Gly Pro Ala Arg He Trp Arg Glu His 
20 25 30 

Ser Met Ala Ala Met Lys Pro Arg Thr Gly Asp Gly Pro Leu Glu Ala 
35 40 45 

Thr Lys Glu Gly Arg Gly He Val Met Arg Val Pro Leu Glu Gly Gly 
50 55 60 

Gly Arg Leu Val Val Glu Leu Thr Pro Asp Glu Ala Ala Ala Leu Gly 
65 70 75 80 

Asp Glu Leu Lys Gly Val Thr Ser 
85 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 95 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID N0:88: 

Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly Asn Phe Glu Arg He 
1 5 10 15 

Ser Gly Asp Leu Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala Glv 
20 25 30 

Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin Ala 
35 40 45 

Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu Leu 
50 55 60 

Asp Glu He Ser Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arq 
65 70 75 80 

Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
85 90 95 

(2) INFORMATION FOR SEQ ID NO:89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 166 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:89: 

Met Thr Gin Ser Gin Thr Val Thr Val Asp Gin Gin Glu He Leu Asn 
1 5 10 15 

Arg Ala Asn Glu Val Glu Ala Pro Met Ala Asp Pro Pro Thr Asp Val 
20 25 30 

Pro He Thr Pro Cys Glu Leu Thr Xaa Xaa Lys Asn Ala Ala Gin Gin 
35 40 45 

Xaa Val Leu Ser Ala Asp Asn Met Arg Glu Tyr Leu Ala Ala Gly Ala 
50 55 60 



o -o 
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Pro Lys Ala Lys Ser His Val Thr Val Val Ala Val Leu Gly Val Leu 
100 105 no 

Gly Val Phe Leu Met Val Ser Ala Thr Phe Asn Lys Pro Ser Ala Tyr 
U5 120 125 

Ser Thr Gly Trp Ala Leu Trp Val Val Leu Ala Phe He Val Phe Gin 
130 135 140 

Ala Val Ala Ala Val Leu Ala Leu Leu Val Glu Thr Gly Ala He Thr 
145 150 155 160 

Ala Pro Ala Pro Arg Pro Lys Phe Asp Pro Tyr Gly Gin Tyr Gly Arg 
165 170 175 

Tyr Gly Gin Tyr Gly Gin Tyr Gly Val Gin Pro Gly Gly Tyr Tyr Glv 
180 185 190 

Gin Gin Gly Ala Gin Gin Ala Ala Gly Leu Gin Ser Pro Gly Pro Gin 
195 200 205 

Gin Ser Pro Gin Pro Pro Gly Tyr Gly Ser Gin Tyr Gly Gly Tyr Ser 
210 215 220 

Ser Ser Pro Ser Gin Ser Gly Ser Gly Tyr Thr Ala Gin Pro Pro Ala 
225 230 235 240 

Gin Pro Pro Ala Gin Ser Gly Ser Gin Gin Ser His Gin Gly Pro Ser 
245 250 255 

Thr Pro Pro Thr Gly Phe Pro Ser Phe Ser Pro Pro Pro Pro Val Ser 
260 265 270 

Ala Gly Thr Gly Ser Gin Ala Gly Ser Ala Pro Val Asn Tyr Ser Asn 
275 280 285 

Pro Ser Gly Gly Glu Gin Ser Ser Ser Pro Gly Gly Ala Pro Val 
290 295 300 

(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:93: 

Gly Cys Gly Glu Thr Asp Ala Ala Thr Leu Ala Gin Glu Ala Gly Asn 
1 5 10 15 

Phe Glu Arg He Ser Gly Asp Leu Lys Thr Gin He 
20 25 

(2) INFORMATION FOR SEQ ID N0:94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:94: 

Asp Gin Val Glu Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



o 



o 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 95: 

Gly Cys Gly Ser Thr Ala Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala 
1 5 10 15 

Ala Gly Thr Ala Ala Gin Ala Ala Val Val Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:96: 

Gly Cys Gly Gly Thr Ala Ala Gin Ala Ala Val Val Arg Phe Gin Glu 
15 10 15 

Ala Ala Asn Lys Gin Lys Gin Glu Leu Asp Glu 
20 25 

(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:97: 

Gly Cys Gly Ala Asn Lys Gin Lys Glh Glu Leu Asp Glu He Ser Thr 
1 5 10 15 
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Asn He Arg Gin Ala Gly Val Gin Tyr Ser Arg 
20 25 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:98: 

Gly Cys Gly He Arg Gin Ala Gly Val Gin Tyr Ser Arg Ala Asp Glu 
1 5 10 15 

Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
20 25 



(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:99: 

ATGAAGATGG TGAAATCGAT CGCCGCAGGT CTGACCGCCG CGGCTGCAAT CGGCGCCGCT 60 

GCGGCCGGTG TGACTTCGAT CATGGCTGGC GGCCCGGTCG TATACCAGAT GCAGCCGGTC 120 

GTCTTCGGCG CGCCACTGCC GTTGGACCCG GCATCCGCCC CTGACGTCCC GACCGCCGCC 180 

CAGTTGACCA GCCTGCTCAA CAGCCTCGCC GATCCCAACG TGTCGTTTGC GAACAAGGGC 240 



o 



o 
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Gin Asn Thr Pro Ala He Ala Val Asn. Glu Ala Glu Tyr Gly Glu Met 
130 135 140 

Trp Ala Gin Asp Ala Ala Ala Met Phe Gly Tyr Ala Ala Thr Ala Ala 
145 150 155 leo 

Thr Ala Thr Glu Ala Leu Leu Pro Phe Glu Asp Ala Pro Leu He Thr 
165 170 175 

Asn Pro Gly Gly Leu Leu Glu Gin Ala Val Ala Val Glu Glu Ala He 
180 185 190 

Asp Thr Ala Ala Ala Asn Gin Leu Met Asn Asn Val Pro Gin Ala Leu 
195 200 205 

Gin Gin Leu Ala Gin Pro Thr Lys Ser He Trp Pro Phe Asp Gin Leu 
210 215 220 

Ser Glu Leu Trp Lys Ala He Ser Pro His Leu Ser Pro Leu Ser Asn 
225 230 235 240 

He Val Ser Met Leu Asn Asn His Val Ser Met Thr Asn Ser Gly Val 
245 250 255 

Ser Met Ala Ser Thr Leu His Ser Met Leu Lys Gly Phe Ala Pro Ala 
260 265 270 

Ala Ala Gin Ala Val Glu Thr Ala Ala Gin Asn Gly Val Gin Ala Met 
275 280 285 

Ser Ser Leu Gly Ser Gin Leu Gly Ser Ser Leu Gly Ser Ser Gly Leu 
290 295 300 

Gly Ala Gly Val Ala Ala Asn Leu Gly Arg Ala Ala Ser Val Gly Ser 
305 310 315 320 

Leu Ser Val Pro Gin Ala Trp Ala Ala Ala Asn Gin Ala Val Thr Pro 
325 330 335 

Ala Ala Arg Ala Leu 
340 

(2) INFORMATION FOR SEQ ID NO: 112: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1256 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 112: 

CATCGGAGGG AGTGATCACC ATGCTGTGGC ACGCAATGCC ACCGGAGNTA AATACCGCAC 60 

GGCTGATGGC CGGCGCGGGT CCGGCTCCAA TGCTTGCGGC GGCCGCGGGA TGGCAGACGC 120 

TTTCGGCGGC TCTGGACGCT CAGGCCGTCG AGTTGACCGC GCGCCTGAAC TCTCTGGGAG 180 

AAGCCTGGAC TGGAGGTGGC AGCGACAAGG CGCTTGCGGC TGCAACGCCG ATGGTGGTCT 240 

GGCTACAAAC CGCGTCAACA CAGGCCAAGA CCCGTGCGAT GCAGGCGACG GCGCAAGCCG 300 

CGGCATACAC CCAGGCCATG GCCACGACGC CGTCGCTGCC GGAGATCGCC GCCAACCACA 360 

TCACCCAGGC CGTCCTTACG GCCACCAACT TCTTCGGTAT CAACACGATC CCGATCGCGT 420 

TGACCGAGAT GGATTATTTC ATCCGTATGT GGAACCAGGC AGCCCTGGCA ATGGAGGTCT 480 

ACCAGGCCGA GACCGCGGTT AACACGCTTT TCGAGAAGCT CGAGCCGATG GCGTCGATCC 540 

TTGATCCCGG CGCGAGCCAG AGCACGACGA ACCCGATCTT CGGAATGCCC TCCCCTGGCA 600 

GCTCAACACC GGTTGGCCAG TTGCCGCCGG CGGCTACCCA GACCCTCGGC CAACTGGGTG 660 

AGATGAGCGG CCCGATGCAG CAGCTGACCC AGCCGCTGCA GCAGGTGACG TCGTTGTTCA " 720 

GCCAGGTGGG CGGCACCGGC GGCGGCAACC CAGCCGACGA GGAAGCCGCG CAGATGGGCC 780 

TGCTCGGCAC CAGTCCGCTG TCGAACCATC CGCTGGCTGG TGGATCAGGC CCCAGCGCGG 840 

GCGCGGGCCT GCTGCGCGCG GAGTCGCTAC CTGGCGCAGG TGGGTCGTTG ACCCGCACGC 900 

CGCTGATGTC TCAGCTGATC GAAAAGCCGG TTGCCCCCTC GGTGATGCCG GCGGCTGCTG 960 

CCGGATCGTC GGCGACGGGT GGCGCCGCTC CGGTGGGTGC GGGAGCGATG GGCCAGGGTG 1020 



o 
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CGCAATCCGG CGGCTCCACC AGGCCGGGTC TGGTCGCGCC GGCACCGCTC GCGCAGGAGC 1080 

GTGAAGAAGA CGACGAGGAC GACTGGGACG MGAGGACGA CTGGTGAGCT CCCGTAATGA 1140 

CAACAGACTT CCCGGCCACC CGGGCCGGAA GACTTGCCAA CATTTTGGCG AGGAAGGTAA 1200 

AGAGAGAAAG TAGTCCAGCA TGGCAGAGAT GAAGACCGAT GCCGCTACCC TCGCGC 1256 
(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:113: 

CTAGTGGATG GGACCATGGC CATTTTCTGC AGTCTCACTG CCTTCTGTGT TGACATTTTG 60 

GCACGCCGGC GGAAACGAAG CACTGGGGTC GAAGAACGGC TGCGCTGCCA TATCGTCCGG 120 

AGCTTCCATA CCTTCGTGCG GCCGGAAGAG CTTGTCGTAG TCGGCCGCCA TGACAACCTC 180 

TCAGAGTGCG CTCAAACGTA TAAACACGAG AAAGGGCGAG ACCGACGGAA GGTCGAACTC 240 

GCCCGATCCC GTGTTTCGCT ATTCTACGCG AACTCGGCGT TGCCCTATGC GAACATCCCA 300 

GTGACGTTGC CTTCGGTCGA AGCCATTGCC TGACCGGCTT CGCTGATCGT CCGCGCCAGG 360 

TTCTGCAGCG CGTTGTTCAG CTCGGTAGCC GTGGCGTCCC ATTTTTGCTG GACACCCTGG 420 

TACGCCTCCG AA 432 
(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 368 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114: 

Met Leu Trp His Ala Met Pro Pro Glu Xaa Asn Thr Ala Arg Leu Met 
1 5 10 15 

Ala Gly Ala Gly Pro Ala Pro Met Leu Ala Ala Ala Ala Gly Trp Gin 
20 25 30 

Thr Leu Ser Ala Ala Leu Asp Ala Gin Ala Val Glu Leu Thr Ala Arg 
35 40 45 

Leu Asn Ser Leu Gly Glu Ala Trp Thr Gly Gly Gly Ser Asp Lys Ala 
50 55 60 

Leu Ala Ala Ala Thr Pro Met Val Val Trp Leu Gin Thr Ala Ser Thr 
65 70 75 80 

Gin Ala Lys Thr Arg Ala Met Gin Ala Thr Ala Gin Ala Ala Ala Tyr 
85 90 95 

Thr Gin Ala Met Ala Thr Thr Pro Ser Leu Pro Glu He Ala Ala Asn 
100 105 110 

His He Thr Gin Ala Val Leu Thr Ala Thr Asn Phe Phe Gly He Asn 
115 120 125 

Thr He Pro He Ala Leu Thr Glu Met Asp Tyr Phe He Arg Met Trp 
130 135 140 

Asn Gin Ala Ala Leu Ala Met Glu Val Tyr Gin Ala Glu Thr Ala Val 
145 150 155 160 

Asn Thr Leu Phe Glu Lys Leu Glu Pro Met Ala Ser He Leu Asp Pro 
165 170 175 

Gly Ala Ser Gin Ser Thr Thr Asn Pro He Phe Gly Met Pro Ser Pro 
180 185 190 

Gly Ser Ser Thr Pro Val Gly Gin Leu Pro Pro Ala Ala Thr Gin Thr 
195 200 205 
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Leu Gly Gin Leu Gly Glu Met Ser Gly Pro Met Gin Gin Leu Thr Gin 
210 215 220 

Pro Leu Gin Gin Val Thr Ser Leu Phe Ser Gin Val Gly Gly Thr Gly 
225 230 235 240 

Gly Gly Asn Pro Ala Asp Glu Glu Ala Ala Gin Met Gly Leu Leu Gly 
245 250 255 

Thr Ser Pro Leu Ser Asn His Pro Leu Ala Gly Gly Ser Gly Pro Ser 
260 265 270 

Ala Gly Ala Gly Leu Leu Arg Ala Glu Ser Leu Pro Gly Ala Gly Gly 
275 280 285 



Ser Leu Thr Arg Thr Pro Leu Met Ser Gin Leu He Glu Lys Pro Val 
290 295 300 

Ala Pro Ser Val Met Pro Ala Ala Ala Ala Gly Ser Ser Ala Thr Gly 
305 310 315 320 

Gly Ala Ala Pro Val Gly Ala Gly Ala Met Gly Gin Gly Ala Gin Ser 
325 330 335 

Gly Gly Ser Thr Arg Pro Gly Leu Val Ala Pro Ala Pro Leu Ala Gin 
340 345 350 

Glu Arg Glu Glu Asp Asp Glu Asp Asp Trp Asp Glu Glu Asp Asp Trp 
355 360 365 



(2) INFORMATION FOR SEQ ID NO: 115: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 
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Met Ala Glu Met Lys Thr Asp Ala Ala Thr Leu Ala 
1 5 io 

(2) INFORMATION FOR SEQ ID NO: 116: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:116: 



GATCTCCGGC GACCTGAAAA CCCAGATCGA CCAGGTGGAG 


TCGACGGCAG GTTCGTTGCA 


60 


GGGCCAGTGG CGCGGCGCGG CGGGGACGGC CGCCCAGGCC 


GCGGTGGTGC GCTTCCAAGA 


120 


AGCAGCCAAT AAGCAGAAGC AGGAACTCGA CGAGATCTCG 


ACGAATATTC GTCAGGCCGG 


180 


CGTCCAATAC TCGAGGGCCG ACGAGGAGCA GCAGCAGGCG 


CTGTCCTCGC AAATGGGCTT 


240 


CTGACCCGCT AATACGAAAA GAAACGGAGC AAAAACATGA 


CAGAGCAGCA GTGGAATTTC 


300 


GCGGGTATCG AGGCCGCGGC AAGCGCAATC CAGGGAAATG 


TCACGTCCAT TCATTCCCTC 


360 


CTTGACGAGG GGAAGCAGTC CCTGACCAAG CTCGCA 




396 


(2) INFORMATION FOR SEQ ID NO: 117: 







(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 117: 
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He Ser Gly Asp Leu Lys Thr Gin He Asp Gin Val Glu Ser Thr Ala 
15 10 15 

Gly Ser Leu Gin Gly Gin Trp Arg Gly Ala Ala Gly Thr Ala Ala Gin 
20 25 30 

Ala Ala Val Val Arg Phe Gin Glu Ala Ala Asn Lys Gin Lys Gin Glu 
35 40 45 

Leu Asp Glu He Ser Thr Asn He Arg Gin Ala Gly Val Gin Tyr Ser 
50 55 60 

Arg Ala Asp Glu Glu Gin Gin Gin Ala Leu Ser Ser Gin Met Gly Phe 
65 70 75 80 



(2) INFORMATION FOR SEQ ID NO: 118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 387 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

GTGGATCCCG ATCCCGTGTT TCGCTATTCT ACGCGAACTC GGCGTTGCCC TATGCGAACA 60 

TCCCAGTGAC GTTGCCTTCG GTCGAAGCCA TTGCCTGACC GGCTTCGCTG ATCGTCCGCG 120 

CCAGGTTCTG CAGCGCGTTG TTCAGCTCGG TAGCCGTGGC GTCCCATTTT TGCTGGACAC 180 

CCTGGTACGC CTCCGAACCG CTACCGCCCC AGGCCGCTGC GAGCTTGGTC AGGGACTGCT 240 

TCCCCTCGTC AAGGAGGGAA TGAATGGACG TGACATTTCC CTGGATTGCG CTTGCCGCGG 300 

CCTCGATACC CGCGAAATTC CACTGCTGCT CTGTCATGTT TTTGCTCCGT TTCTTTTCGT 360 

ATTAGCGGGT CAGAAGCCCA TTTGCGA 387 
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(2) INFORMATION FOR SEQ ID NO: 119: 

(1) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 
CGGCACGAGG ATCTCGGTTG GCCCAACGGC GCTGGCGAGG GCTCCGTTCC GGGGGCGAGC 
TGCGCGCCGG ATGCTTCCTC TGCCCGCAGC CGCGCCTGGA TGGATGGACC AGTTGCTACC 
TTCCCGACGT TTCGTTCGGT GTCTGTGCGA TAGCGGTGAC CCCGGCGCGC ACGTCGGGAG 
TGTTGGGGGG CAGGCCGGGT CGGTGGTTCG GCCGGGGACG CAGACGGTCT GGACGGAACG 
GGCGGGGGTT CGCCGATTGG CATCTTTGCC CA 
(2) INFORMATION FOR SEQ ID NO: 120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:120: 

Asp Pro Val Asp Ala Val He Asn Thr Thr Cys Asn Tyr Gly Gin Val 
1 5 10 15 

Val Ala Ala Leu 
20 

(2) INFORMATION FOR SEQ ID NO: 121: 
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Claims 

1. A polypeptide comprising an immunogenic portion of a soluble 
M. tuberculosis antigen, or a variant of said antigen that differs only in conservative 
substitutions and/or modifications, wherein said antigen has an N-terminal sequence selected 
from the group consisting of: 

(a) Asp-Pro-Val-Asp-Ala-Val-Ile-Asn-Thr-Thr-Cys-Asn-Tyr-Gly-Gln- 
Val-Val-Ala-Ala-Leu; (SEQ ID No. 120) 

(b) Ala-Val-Glu-Ser-Gly-Met-Leu-Ala-Leu-Gly-Thr-Pro-Ala-Pro-Ser; 
(SEQ ID No. 121) 

(c) Ala-Ala-Met-Lys-Pro-Arg-Thr-Gly-Asp-Gly-Pro-Leu-Glu-Ala-Ala- 
Lys-Glu-Gly-Arg; (SEQ ID No. 122) 

(d) Tyr-Tyr-Trp-Cys-Pro-GIy-Gln-Pro-Phe-Asp-Pro-Ala-Trp-Gly-Pro; 
(SEQ ID No. 123) 

(e) Asp-Ile-Gly-Ser-Glu-Ser-Thr-Glu-Asp-Gln-Gln-Xaa-Ala-Val; (SEQ 
ID No. 124) 

(£) Ala-Glu-Glu-Ser-Ile-Ser-Thr-Xaa-Glu-Xaa-Ile-Val-Pro; (SEQ ID No. 
125) 

(g) Asp-Pro-Glu-Pro-Ala-Pro-Pro-Val-Pro-Thr-Thr-Ala-Ala-Ser-Pro-Pro- 
Ser; (SEQ ID No. 126) 

(h) Ala-Pro-Lys-Thr-Tyr-Xaa-Glu-Glu-Leu-Lys-Gly-Thr-Asp-Thr-Gly; 
(SEQ ID No. 127) 

(i) Asp-Pro-Ala-Ser-Ala-Pro-Asp-Val-Pro-Thr-Ala-Ala-Gln-Leu-Thr-Ser- 
Leu-Leu-Asn-Ser-Leu-Ala-Asp-Pro-Asn-Val-Ser-Phe-Ala-Asn; (SEQ 
ID No. 128) and 

0) Ala-Pro-Glu-Ser-Gly-Ala-Gly-Leu-Gly-GIy-Thr-Val-Gln-Ala-Gly; 
(SEQ ID No. 136) 
wherein Xaa may be any amino acid. 



2. A polypeptide comprising an immunogenic portion of an 
M. tuberculosis antigen, or a variant of said antigen that differs only in conservative 
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substitutions and/or modifications, wherein said antigen has an N-tenninal sequence selected 
from the group consisting of: 

(a) Asp-Pro-Pro-Asp-Pro-His-Gln-Xaa-Asp-Met-Thr-Lys-Gly-Tyr-Tyr- 
Pro-Gly-Gly-Arg-Arg-Xaa-Phe; (SEQ ID No. 129) and 

(b) Xaa-Tyr-Ile-Ala-Tyr-Xaa-Thr-Thr-Ala-Gly-Ile-Val-Pro-Gly-Lys-Ile- 
Asn-Val-His-Leu-Val; (SEQ ID No. 137), wherein Xaa may be any 
amino acid. 



3. A polypeptide comprising an immunogenic portion of a soluble 
M. tuberculosis antigen, or a variant of said antigen that differs only in conservative 
substitutions and/or modifications, wherein said antigen comprises an amino acid sequence 
encoded by a DNA sequence selected from the group consisting of the sequences recited in 
SEQ ID Nos.: 1, 2, 4-10, 13-25, 52, 99 and 101, the complements of said sequences, and 
DNA sequences that hybridize to a sequence recited in SEQ ID Nos.: 1, 2, 4-10, 13-25, 52, 
99 and 101 or a complement thereof under moderately stringent conditions. 

4. A polypeptide comprising an immunogenic portion of a 
M tuberculosis antigen, or a variant of said antigen that differs only in conservative 
substitutions and/or modifications, wherein said antigen comprises an amino acid sequence 
encoded by a DNA sequence selected from the group consisting of the sequences recited in 
SEQ ID Nos.: 26-51, the complements of said sequences, and DNA sequences that hybridize 
to a sequence recited in SEQ ID Nos.: 26-51 or a complement thereof under moderately 
stringent conditions. 

5. A DNA molecule comprising a nucleotide sequence encoding a 
polypeptide according to any one of claims 1-4. 

6. An expression vector comprising a DNA molecule according to claim 

5. 

7. A host cell transformed with an expression vector according to claim 6. 
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8. The host cell of claim 7 wherein the host cell is selected from the group 
consisting of K coli, yeast and mammalian cells. 

9. A pharmaceutical composition comprising one or more polypeptides 
according to any one of claims 1 -4 and a physiologically acceptable carrier. 

10. A pharmaceutical composition comprising one or more DNA 
molecules according to claim 5 and a physiologically acceptable carrier. 

11. A pharmaceutical composition comprising one or more DNA 
sequences recited in SEQ ID Nos.: 3, 1 1 and 12; and a physiologically acceptable carrier. 

12. A vaccine comprising one or more polypeptides according to any one 
of claims 1-4 and a non-specific immune response enhancer. 

13. A vaccine comprising: 

a polypeptide having an N-terminal sequence selected from the group 
consisting of sequences recited in SEQ ID NO: 134 and 135; and 
a non-specific immune response enhancer. 

14. A vaccine comprising: 

one or more polypeptides encoded by a DNA sequence selected from the 
group consisting of SEQ ID Nos.: 3, 1 1 and 12, the complements of said sequences, and 
DNA sequences that hybridize to a sequence recited in SEQ ID Nos.: 3, 1 1 and 12; and 

a non-specific immune response enhancer. 

15. The vaccine of claims 12-14 wherein the non-specific immune 
response enhancer is an adjuvant. 



16. A vaccine comprising one or more DNA molecules according to claim 
5 and a non-specific immune response enhancer. 
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17. A vaccine comprising one or more DNA sequences recited in SEQ ED 
Nos.: 3,11 and 12; and a non-specific immune response enhancer. 

18. The vaccine of claims 16 or 17 wherein the non-specific immune 
response enhancer is an adjuvant 

19. A method for inducing protective immunity in a patient, comprising 
administering to a patient a pharmaceutical composition according to any one of claims 9-11. 

20. A method for inducing protective immunity in a patient, comprising 
administering to a patient a vaccine according to any one of claims 12-18. 

21. A fusion protein comprising two or more polypeptides according to 
any one of claims 1-4. 

22. A fusion protein comprising one or more polypeptides according to 
any one of claims 1-4 and ESAT-6. 

23. A pharmaceutical composition comprising a fusion protein according 
to claim 21 or 22 and a physiologically acceptable carrier. 

24. A vaccine comprising a fusion protein according to claims 21 or 22 and 
a non-specific immune response enhancer. 

25. The vaccine of claim 24 wherein the non-specific immune response 
enhancer is an adjuvant. 

26. A method for inducing protective immunity in a patient, comprising 
administering to a patient a pharmaceutical composition according to claim 23. 



27. A method for inducing protective immunity in a patient, comprising 
administering to a patient a vaccine according to claims 24 or 25. 



WO 97/09428 



PCT/US96/14674 



162 

28. A method for detecting tuberculosis in a patient, comprising: 

(a) contacting dermal cells of a patient with one or more polypeptides 
according to any one of claims 1-4; and 

(b) detecting an immune response on the patient's skin and therefrom 
detecting tuberculosis in the patient. 

29. A method for detecting tuberculosis in a patient, comprising: 

(a) contacting dermal cells of a patient with a polypeptide having an N- 
tenninal sequence selected from the group consisting of sequences recited in SEQ ID NO: 
134 and 135; and 

(b) detecting an immune response on the patient's skin and therefrom 
detecting tuberculosis in the patient 

30. A method for detecting tuberculosis in a patient, comprising: 

(a) contacting dermal cells of a patient with one or more polypeptides 
encoded by a DNA sequence selected from the group consisting of SEQ ID Nos.: 3, 11 and 
12, the complements of said sequences, and DNA sequences that hybridize to a sequence 
recited in SEQ ID Nos,: 3, 1 1 and 12; and 

(b) detecting an immune response on the patient's skin and therefrom 
detecting tuberculosis in the patient. 

31. The method of any one of claims 28-30 wherein the immune response 

is induration. 

32. A diagnostic kit comprising: 

(a) a polypeptide according to any one of claims 1-4; and 

(b) apparatus sufficient to contact said polypeptide with the dermal cells of 

a patient. 
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33. A diagnostic kit comprising: 

(a) a polypeptide having an N-tenninal sequence selected from the group 
consisting of sequences recited in SEQ ID NO: 134 and 135; and 

(b) apparatus sufficient to contact said polypeptide with the dermal cells of 

a patient 

34. A diagnostic kit comprising: 

(a) a polypeptide encoded by a DNA sequence selected from the group 
consisting of SEQ ID Nos.: 3, 11 and 12, the complements of said sequences, and DNA 
sequences that hybridize to a sequence recited in SEQ ID Nos.: 3, 1 1 and 12; and 

(b) apparatus sufficient to contact said polypeptide with the dermal cells of 

a patient. 
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